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The Tap-Leakage Algorithm: An Algorithm for 
the Stable Operation of a Digitally 
Implemented, Fractionally Spaced 

Adaptive Equalizer 


By R. D. GITLIN, H. C. MEADORS, JR., and S. B. WEINSTEIN 


(Manuscript received January 19, 1982) 


A fractionally spaced equalizer is a nonrecursive adaptive filter 
whose tap weights are spaced a fraction of a symbol interval apart. 
Such an equalizer can significantly enhance modem performance in 
the presence of severe linear distortion, when compared with a 
conventional synchronous equalizer whose taps are spaced a symbol 
interval apart. However, a digitally implemented, fractionally spaced 
equalizer generally will exhibit long-term instability when the con- 
ventional tap-adjustment algorithm is used. This occurs because, in 
contrast to the synchronous equalizer, a fractionally spaced equalizer 
generally will have many sets of tap values, which result in nearly 
equal values of mean-squared error (mse). Some of these tap set- 
tings—which invariably will be attained because of biases in the 
digital tap-updating circuitry—are large enough to cause register 
overflows and consequent performance deterioration. In this paper 
we report how a simple modification in the tap-adjustment algorithm 
provides a solution to the above problem. The modified tap-adjust- 
ment algorithm prevents the buildup of large coefficient values by 
systematically “leaking” or decreasing the magnitudes of all the 
equalizer tap weights. For an experimental modem operating at 9.6 
kb/s, it has been demonstrated that the tap-leakage adjustment 
algorithm prevents the accumulation of large equalizer tap values, 
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while permitting the full performance gain of a fractionally spaced 
equalizer to be realized. 


l. INTRODUCTION 


Fractionally spaced equalizers (FSEs), which are nonrecursive, 
tapped, delay-line adaptive filters, are currently receiving much atten- 
tion’ because of the significant performance advantage they provide 
when compared with a conventional synchronous equalizer. Since an 
FSE has the capability to adaptively realize the optimum linear re- 
ceiver, it can greatly improve the performance of a modem in the 
presence of severe linear distortion. More specifically, significant per- 
formance improvements have been observed owing to the ability of an 
FSE to compensate effectively for delay distortion at the limits of 
private-line, voice-grade channel conditioning.® However, in laboratory 
experiments with a digitally implemented FsE it was noticed that after 
an extended period of operation some of the equalizer tap weights 
would invariably become large, while the mean-squared error (mse) 
remained at a satisfactory level. The taps generally would become so 
large that one or more registers, which compute partial sums of the 
equalizer output, would overflow, and the modem performance was 
then substantially degraded. This phenomenon is a consequence of the 
fact that an FSE, in contrast to a conventional synchronous equalizer, 
generally has many sets of tap values that correspond to roughly the 
same mse. Included in the set of tap values that correspond to the 
_minimum mse are some tap coefficients of relatively large magnitude. 
These large tap values can be attained because of the cumulative effect 
of noise or any bias in the digital circuitry that performs the equalizer 
updating. Even though the value of the mse is satisfactory, some of 
these tap values will be large enough to cause occasional overflow of 
the partial sums computed to form the equalizer output. The purpose 
of this paper is twofold: to elucidate why a fractionally spaced equalizer 
has so many apparently “good” sets of tap values, and to indicate how 
equalizer operation can be stabilized by simply modifying the conven- 
tional estimated-gradient tap-adjustment algorithm. 

The “almost unique” nature of the fractionally spaced equalizer 
coefficients is discussed in Section II. In Section III we describe the 
reasons for the occurrence of large tap values, and in Section IV a 
modified adjustment algorithm, dubbed the tap-leakage algorithm, is 
proposed to remedy the observed equalizer instability. The results of 
laboratory experiments are reported in Section V. 

Il. DOES A FRACTIONALLY SPACED EQUALIZER HAVE A UNIQUE 
OPTIMUM SETTING? 
2.1 Fractionally spaced equalizers 

To answer the question posed by the title of this section, we refer to 

the simplified baseband data transmission system shown in Fig. la. 
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Fig. 1—Simplified (baseband) pulse amplitude modulation (PAM) data transmission 
system incorporating an FSE. (a) Transmission system. (b) Spectrum of the transmitted 
pulse. (c) A fractionally spaced equalizer. 


For the purposes of exploring the phenomenon of large-tap buildup, 
this baseband model will suffice. Referring to the figure: {a,} are the 
discrete-valued multilevel data symbols, 1/T is the symbol rate, p(t) 
is the band-limited transmitter pulse (whose spectrum is shown in Fig. 
1b), f(t) is the channel impulse response, and v(t) is the additive 
background noise. Note that the receiving filter output, r(t), is sampled 
at the rate 1/7’, and the samples are then passed through the tapped 
delay-line equalizer (shown in Fig. 1c) having (2N + 1) delay elements 
spaced T’(<T) seconds apart and weighting coefficients {c,}. The FSE 
output 


N 
g(nT) = YX enr(nT- mT’), n=1,2,--- (1) 
m=—N 
is computed at the symbol rate and quantized (sliced) to provide the 
data decision, d,. The transmitted pulse spectrum, shown in Fig. 1b, 
generally will be band-limited to (1 + a)z/T radians/second where the 
rolloff factor, a, varies between 0 and 1. An FSE with tap spacing T”’ 
seconds will have a transfer function, Cr, (w), with period 27/T’, and, 
as shown in Fig. 2, if T’ < T/(1 + a), the transfer function of the FSE 
will span the entire spectral range of the transmitted signal. This 
enables the equalizer to exert complete control over the amplitude and 
delay distortion present in the region 0 < |w| < (1 + a)7/T. Conse- 
quently, the FSE can compensate for these distortions directly, rather 
than filtering the aliased (folded) spectrum, as is done by the conven- 
tional equalizer.® As an example of this feature, consider the ability of 
the FSE to compensate for delay distortion by synthesizing the phase 
characteristic conjugate to that of the received pulse. This operation 
will leave the noise power at the equalizer output unchanged from the 
received noise power. The conventional synchronous equalizer, be- 
cause it has a transfer function that has period 27/7, can only equalize 
the folded spectrum, i.e., the compensation characteristics on either 


: = -jwlT’ 
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“TRANSMITTER” 
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Fig. 2—Transfer function of a fractionally spaced equalizer. 
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side of 7/T are restricted to be the conjugate of one another. If the 
channel characteristics are such that the folded spectrum has a relative 
null at a particular frequency, then the noise at the output of the 
synchronous equalizer can be significantly enhanced and the perform- 
ance degraded proportionately. 

From Fig. 2 it should also be evident that when the noise becomes 
vanishingly small, there is legitimate concern as to what function(s) 
the equalizer will synthesize in the region, (1 + a)7/T < |w| < 27/T, 
where there is no signal energy. It is known’® that for an infinitely 
long equalizer the transfer characteristic that minimizes the mse is 

X *(w) 27 


OO SS lol=— 


2 
} x}o +13] + No 


I 

where the asterisk denotes the complex conjugate, X (w) is the transfer 
function of the received pulse,' x(t), presented to the equalizer input, 
and Np is the noise spectral density. As long as No # 0, the equalizer 
function is zero whenever the received signal has no power; however, 
as No — 0 the derivation of (2) is no longer valid, and, moreover, (2) 
approaches 0/0 in the region (1 + a)7/T < |w| < 7/T. Clearly, as the 
noise vanishes, an infinitely long FSE can synthesize the required 
channel characteristic in the region 0 < |w| < (1 + a)a/T, and an 
arbitrary—and nonunique—characteristic in the remaining frequency 
band. An interesting question, then, is what happens to the optimum 
tap setting for a finite length FSE as the noise becomes vanishingly 
small. 


(2) 








2.2 Uniqueness of solution for finite length FSE as the noise vanishes 


The equalized mse is defined as 
E = ({q(nT) — an]’) = (ex), (3) 


where the brackets denote the ensemble average with respect to the 
data symbols and the noise, and e,, is the equalizer output error, q(T) 
— Gn, at t = nT. The mse is readily evaluated as the quadratic form 


E = ¢'Ac — 2c’x + (a2), (4) 
where the prime denotes the transposed vector, c’ is the tap vector 
(c-v, +++, Co, **+, ¢w), X is the truncated impulse-response vector 
|x(NT"), «++, x(—NT"’)|, and A is the channel-correlation matrix. 
More specifically, the channel vector is given by 

X = (Gn¥n), (5) 


t Thus x(t) is the convolution of the transmitter, channel, and receiver filters. 
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where the received vector is given by r, = |r(nT + NT’), --- , r(nT), 
r(nT — NT’)|, and the A matrix is defined by 


A = (Yn¥n). (6) 


The kith element of the channel correlation matrix is given by 


ie.) 


An= YY x(mT — kT’)x(mT — IT’) + 084-1, (7) 
where o” is the noise variance and 6, is the Kronecker delta. Note that 
A is not a Toeplitz matrix, as it would be for a synchronous equalizer 
(T’ = T). If A is nonsingular then the optimum setting and the 
corresponding minimum mse are obtained from (4) by differentiation, 
and are given by 


Copp = Ax (8a) 
Eo = 1—x’A™x, (8b) 


where (a2) is taken to be unity. 

As seen from (7), the matrix A is the sum of two matrices, and as 
will be evident from the discussion that follows, the channel-dependent 
component of A is always positive semidefinite. Since the other com- 
ponent of the channel-correlation matrix, o’I, is positive definite, then 
A will also be positive definite, and we can conclude that when there 
is noise present, the optimum tap setting is unique. 

We now consider the situation as the noise becomes vanishingly 
small; clearly, the optimum tap setting will be unique if, and only if, A 
is nonsingular. A sufficient condition for A to be nonsingular is the 
nonvanishing of the quatratic form u’Au, for any nonzero test vector 
u with components [w;]. Let us consider in detail this quadratic form, 
which we write from (7) as: 

N 


wAu = > UmAmnlln 
m,n=—N 
N co 
= YY unr Y x(IT — nT’)x(IT — mT’) 
mn=—N l=—oo 
oo N 2 
= 3 MY Unx(IT - mr’) | = 0. (9) 
l=—-o0 | m=—N 


The above inequality establishes the positive semidefinite nature of 
the matrix A, and we see from (9) that u’Au can vanish only if* 


* The authors gratefully acknowledge discussions with J. E. Mazo that led to this 
development. 
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N 
YX unx(IT — mT’) = 0 2=0, +1, +2,---. (10) 


m=—N 


If we define the periodic Fourier transform 


N 
Ur(w) = Yo une’, |w|< (11) 
N 


m=— 


T 
Te 


then we can proceed further by noting that 


~ a ‘i 7 - dw 
Y umx(IT — mT’) = Y um | X (weet) SE 
mae m=—N s 20 
foe) N . d 
= [ »Y Ume —jomT Meyer 
_~ Lm=—-N 20 


-| Ur(o)X (oye? 22 
= 2a 


ae ng 
5 (2k "F Ur(o)X (oe Heit 22 
k 2a 


(2k— Va 


[E y Ur (. zo F] 


k2a . om AW 
‘ a —jolT 
x(o+ 7) ¢ On (12) 


The right-hand side of (12) is recognized as the sample, at ¢ = IT, of a 
function whose Fourier transform, Z.q(w), is contained in the brackets. 
If (12) is to be zero for every value of J, then it must be that the Fourier 
transform inside the integral vanishes completely, i.e., 


k2r h2a T 
Zeq(o) =X Ur (0+ S) x(o+ 4) -0, lol =a: (13) 


In Fig. 3a we show the situation when there is no excess bandwidth, 
and since the sum, (13), reduces to one term, the only way for 
Zeq(w) = 0 is for either X(w) = 0 or Ur-(w) = 0. Since this implies that 
Ur(w) = 0, it would violate the nonzero requirement on u. Thus, we 
can conclude for this case that A is positive definite. A similar sketch 
for the less than 100-percent excess bandwidth case is shown in Fig. 
3b, where it is noted that only the k = 0, +1 terms contribute to the 
sum, (13). However, in the nonrolloff region, |w| <= (1 — a)7/T, only 
the k = 0 term influences the sum. For channels that do not vanish 
over the entire nonrolloff region, it is clear that for Z.q(w) to vanish it 
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(a) (b) (c) 
Fig. 3—Sketches associated with eq. (13). (a) a = 0. (b) O<a<1.(c)a=1. 


is required that U7(w) vanish at least over the entire nonrolloff region. 
Since Ur-(w) is a finite-term Fourier series, it cannot vanish over an 
interval without vanishing everywhere, which in turn would again 
make u = 0. Note that if the channel vanished over a portion of the 
nonrolloff region, then since Z.q(w) is a finite-term Fourier series, its 
energy could not be totally concentrated in the region where there was 
no channel energy. Thus, the solution still would be unique. It is worth 
noting that in the extreme case of 100-percent excess bandwidth, Z.,q(w) 
can vanish. For example, in Fig. 3c we sketch the situation for a 
constant X(w), and with Ur(w) = cos wT/2 it is apparent that 
Zeq(w) = 0. Thus, for a finite-length FSE with an excess bandwidth of 
less than 100 percent, we can conclude that even as the noise becomes 
vanishingly small, the A matrix is nonsingular and there is a unique 
optimum tap setting. 

We digress for a moment to point out that for a finite-length 
synchronous equalizer where 7” = T, (18) indicates that since 
Ur(w + k22/T) = Ur(w), we can conclude that if the folded channel 
spectrum does not vanish completely, then there is always a unique 
tap setting. 


Il. THE TAP-WANDERING PHENOMENON 
3.1 Motivation, background, and infinite-precision considerations 

We have shown that for a finite-length fractionally spaced equalizer 
and the practical range of interest—where the excess bandwidth is on 


the order of 10 to 50 percent—even with vanishingly small noise there 


will always be a unique best tap setting. One issue of interest is the 
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“closeness” (or ill-conditioning) of the A matrix to a singular matrix. 
This is important for two reasons. First, the distribution of the eigen- 
values of A, which is a measure of the ill-conditioning of the matrix, 
influences the rate of convergence of the equalizer taps to their 
optimum setting." Second, and more importantly, we observe, from 
(4) that the contours of equal mse are elliptical, and the eccentricity of 
these contours is directly related to the eigenvalue distribution. In the 
appendix it is shown that for an infinitely long equalizer with T’ = 
T/2, half the eigenvalues are zero; in Fig. 4 we illustrate some constant 
mse contours for a finite-length 7/2 equalizer, whose optimum tap 
setting is denoted by cop. Recall’® that even with an infinite-precision 
analog implementation, the use of a finite step-size in the conventional 
estimated-gradient tap-adjustment algorithm results in a steady-state 
mse that exceeds* E.,. This is depicted in Fig. 4, where the boldface 
contour is the mse that can be attained with the chosen step-size. 
Owing to the random component in the algorithm’s correction term, 
the taps will wander along the constant mse contour, and there will be 
a certain probability that the taps will become so large that one or 
more registers will saturate.’ Thus, even in an analog implementation, 
random tap wandering can, in principle, lead to degraded performance. 


3.2 A model for tap drifting in digital equalizer 


It has been observed in laboratory experiments with a digitally 
implemented FsE, that under control of the conventional estimated- 
gradient tap adjustment algorithm,”® 


Crit = Cn — afenrn], n= 1, 2,3,--- (14) 


the equalizer taps inevitably drift close to the shaded (large tap) region 
of Fig. 4. In (14), ec, is the tap vector at t = nT, a is a positive value 
called the step-size, which influences both the convergence rate of the 
equalizer and the steady-state mse, and the brackets around e,Yrn 
indicate that this increment is quantized to a specified number of bits. 
The term [e,r,] will have a deterministic component proportional to 
the desired gradient, and a random component owing to both the 
manner in which the digital quantization is performed and the influ- 
ence of the noise and data-dependent terms. Generally, [e,r,] will also 
possess a deterministic component owing to bias inevitably present in 
a digital implementation. A typical mechanism for such a bias is the 
two’s complement type of quantizing characteristic shown in Fig. 5. 
To quantify our discussion, we denote the bias by a time-invariant 


* The mse Eopt is achieved when the taps are at their optimum values, cop. 
* For a synchronous equalizer the ellipses will not be very eccentric, and the tap 
wandering will not do any damage. 
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Cy 
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Fig. 4—Contours of equal mse and tap convergence. 








vector, b, and model the adjustment algorithm as 
Cn+1 = Cn — a(en¥n of b), (15) 


where the bias vector has equal components, and b has a magnitude of 
less than half a quantization interval. Equation (15) ignores, except for 
the bias, the effect of limited precision on the algorithm.* Since it has 
been observed in the laboratory that tap wandering results in a 
systematic buildup of some tap values, the model expressed by (15) 
should be useful in relating the magnitude of the bias to the other 
system parameters. It is bias component that can drive, in a determin- 
istic manner,' the tap vector towards the tap region corresponding to 
large tap values. Since the equalizer output is formed as a series of 
partial sums, of the form ¥\, Cmfn—m, it is clear that large-tap values can 
lead either to an overflow of a partial sum or saturation of a tap. As 
the taps grow, occasional register overflows begin to occur, resulting in 
noise-like “hits” on the equalizer output. The occurrence of such a 
“hit” is a function of the specific pattern of data samples contained in 
the equalizer. Continued growth of the taps increases the frequency of 
the “hits” as more data patterns can produce these events. The error 
rate can become very high, relative to what constitutes acceptable 
performance, but the frequency of occurrence of “hits” is still low 


* Reference 10 discusses the effect of limited precision on the mse or an FSE in the 
Bpsence of a bias term. Note that the bias as “seen” by the taps is ab. 
t As opposed to the random wandering associated with the self-noise of the algorithm. 
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INPUT 





Fig. 5—Quantizing characteristic that produces a bias in the tap-adjustment algo- 
rithm. 


enough that the mse is almost unaffected. Growth continues until the 
tap coefficients themselves begin to saturate. At this point severe 
degradation occurs as the degrees of freedom of the system are reduced. 
In fact, with an experimental digitally implemented FSE, with a symbol 
rate of 2400 symbols/second, overflows typically begin to occur within 
several minutes of operation. The overflows produce noise-like hits 
and a degraded equalizer output. 


3.3 The mean tap error and the mean-squared error 


To assess the effects of the bias quantitatively we first define the 
tap-error vector, 


En = Cn — Copt, (16) 


and then use the model described by (15) to write the tap-error 
evolution as 


E€n+1 = €n — a(entn - b) . (17) 


The mse at the nth iteration,* E,,, is given by 


En = Eopt — (€,A€n) . (18) 
Our intent is to study the excess mse, 
Qn = (€n,AEn), (19) 


and the mean tap error vector, (€,), in the presence of the bias, b. 


*In arriving at (18), we assume that iterations are infrequent enough so that 
successive vectors {r,} are independent. In practice, adjustments are generally made at 
the symbol rate, and the algorithm is observed to behave as if the {r,,} were independent. 
This phenomenon is discussed in Ref. 11. 


TAP-LEAKAGE ALGORITHM 1827 


From (17) we have 
(€n41) = (€n) — a(en¥n) — ab = (I — aA)(€n) — ab, (20) 
and thus the steady-state mean tap error satisfies 
{e) =A"b. (21) 


If A; and p;, respectively, denote the ith eigenvalue and eigenvector of 
A, then 
N r 
pib 
(e)= 2 


= Dix 22 
47 ?P (22) 


Clearly, if there is a small eigenvalue whose eigenvector is not orthog- 
onal to b, then the steady-state tap error can be quite large. It is 
interesting to note that for a small number of taps, one would expect 
that the eigenvalues not be small, i.e., the equalizer would not possess 
enough degrees of freedom to realize a somewhat arbitrary transfer 
function beyond the rolloff region. Consider the one-tap equalizer (or 
automatic gain control) where A = (r?(nT)), and consequently 


7 b 
AT) 
Thus, for a one-tap equalizer, the tap error is directly proportional to 
the magnitude of the bias, and the buildup of a large tap value is 
prohibited. However, in the limit as the number of taps becomes 
infinite, it is shown in the appendix that with T’ = T/2, half the 
eigenvalues are zero, while the other half tend to uniformly sample the 
aliased (with respect to the symbol rate) squared magnitude of the 
channel transfer function. Moreover, the eigenvectors corresponding 
to the zero eigenvalues have most of their energy concentrated near 
1/T Hz (and are thus close to being orthogonal to b), while the ith 
eigenvector corresponding to the nonzero eigenvalues approaches a 
sinusoid of radian frequency w; = i/N 7/T. For practical, finite-length 
equalizers, these limiting conditions will only be approximated, and 
there will be small eigenvalues whose corresponding eigenvector is not 
orthogonal to b. Consequently, (€) can become as large as the largest 
ratio [pjb]/A:, and the steady-state tap error would then be biased 
away from the optimum value. 

We now discuss the effect of the bias term on the equalized mse, 


E, = Eopt + (€nAEn), (24) 


(23) 


where the tap error evolves according to (17). In particular we will 


aAvaAmMINnA tha 7m nf + a yY. 
examine tne Sizé Or tne & 


Qn = (€,AEn). (25) 
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An exact analysis (or tight bounds) of the behavior of qg, is an extremely 
difficult problem; however, by assuming that the sequence {r,} is 
independent,”’ and that when the taps are at or near their optimum 
settings, the squared output error is relatively insensitive to the trans- 
mitted data pattern,* it is possible to establish simple, but useful, 
relationships between the relevant system parameters. From (17) we 
have that 


Qnti = (€n+1A€nsi) = (Len — alent’ + b’)JAlen — a(entn + b)]), (26) 
and by using the above assumptions we have (see Ref. 10) 
gnai = [1 — 2ad + a’Am(2N + 1)(r2) lan 

+ a’Am(2N + 1)((rn)Eop + 0"), (27) 


where A,y is the maximum eigenvalue of A, and where 


1 N 
i 2 
ani, 2) 





A= 


is the average eigenvalue. Thus, the steady-state fluctuation about the 
minimum mse is 


adm(2N + 1)[(r?(nT)) Eopt + 67] 


2h — adm(2N + 1)(r(nT)) 2) 


Qa = 

To assess the effect of the bias on g., we note that the bias “seen” 
by a tap component, ab, will be approximately 2-?Cmax, where B is the 
number of bits used to represent the tap weights and Cyax is the 
maximum tap value. If the equalized signal is assumed to have unity 
power, then Cmax ~ 1/[(2N + 1)(ri,)]'” and a is typically” on the 
order of 1/|(2N + 1)-(rz) |. Thus, b’/|r?(nT) | will be on the order of 
2°°3(2N + 1), and when the equalized output signal power is unity, 
Ep is roughly the inverse of the output signal-to-noise ratio. With 
typical parameters like (2N + 1) = 60, B = 12, and Eop: = 0.001, it is 
clear that the effect of the bias on q.. is negligible. Thus, owing to the 
quantizing bias, there can be, on the average, a buildup of one or more 
large-tap weights, while the mse is relatively unaffected. In other 
words, under the influence of a bias the taps would still remain on the 
boldface mse contour of Fig. 4, but would spend most of the time near 
the shaded region, and the system would be subject to random over- 
flows, or hits. This phenomenon has been repeatedly observed exper- 
imentally, and in the next section we will describe a very simple means 
of controlling the tap wandering. 


_* When the taps are at their optimum values, the error is known to be uncorrelated 
with the received samples. 
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IV. THE TAP-LEAKAGE EQUALIZER ADJUSTMENT ALGORITHM 


As we have discussed in the previous section, some or all of the tap 
weights in an FSE can reach unacceptably large values when the 
conventional tap adjustment algorithm, (14), is used. A simple means 
of controlling large-tap buildup is by minimizing either of the aug- 
mented cost functions 


N 
j= E+ iv } c}, (30a) 
i=—N 


i=— 


N 
Jg=E+p ¥ leil, (30b) 
i=—N 

where, as in (3), EF is the mse and p is a suitably chosen (small) 
constant. The cost function J; ascribes a quadratic penalty to the 
magnitude of the tap vector, while J2 provides a magnitude penalty, 
ie., the cost function is penalized whenever the tap vector builds up 
excessively. Since the taps are to be adjusted adaptively, we cannot 
interpret » as a Lagrange multiplier. The use of a Lagrange multiplier 
would be appropriate if we were actually able to minimize J in a 
deterministic manner by using the true gradient. However, since the 
gradient of E with respect to c, Ac — x, is not available, we must 
implement a stochastic algorithm analogous to (14). Thus, » must be 
chosen beforehand by using some prior knowledge of the system 
parameters. 


4.1 Increased steady-state mse: true-gradient algorithm 


As a preliminary calculation, let us first consider the degradation in 
the minimum attainable steady-state mse caused by choosing c to 
minimize J; instead of EF. Note that, for the moment, we are neglecting 
the bias and only assessing the increased mse caused by minimizing 
the augmented cost function, Ji, via the true-gradient algorithm. From 
(4) we observe, for binary transmission, that the taps will attempt to 
minimize the modified criterion: 


Ji =e’ (A + ple — 2c’x + 1. (31) 
c’Be — 2e’x + 1, (32) 


where B = A + pl. The matrix B has the same eigenvectors as A, 
while the eigenvalues of B are A + wp. Note that the contours of equal 
values of J; are still ellipses but the maximum-to-minimum eigenvalue 
ratio governing the tap wandering is now (Amax + [)/(Amin + #4), where 
Amax and Amin are the maximum and minimum eigenvalues of A, 


respectively. Thus, by choosing pw properly, the eccentricity can be 
controlled, and the equalizer tap vector is now determined as if the 


Meee SPEED, Glthi Lilt CssLts av aavur M4 UU 5 46esesTe AN AR EAD 


noise power were increased from o” to o* + p. Of course, the use of a 
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tap vector selected on the basis of the pseudo-noise power, o” + p, will 
increase the steady-state mse.* Note that the steady-state tap vector 
will now satisfy 


Be=x 
or 
ce(u) = Box = (A + pl)’x, (33) 


where c(u) denotes the steady-state tap vector corresponding to the 
chosen value of u. Thus, the minimum attainable mse is given by 


E(u) = e'(w)Ac(u) — 2e"(u)x + I, (34) 
and the increased mse, E(u) — Eopt, is 
E(u) — Eopt = [€(u) — Copt]’A[e() — Copi]. (35) 
To make a more detailed evaluation of the increase in mse, we let 
€(u) = €(u) — Copt (36) 


denote the tap-error vector. Recalling the diagonalization 


N 
A= dX Appi, (37) 


i=— 


and the fact that inverse matrices have the same eigenvectors and 
inverse eigenvalues, we find that 


c(h) — Copp = [(A + pl)7' — Ao ]x 


i ae 
, (5 + x) iad 
ee 
= T+ wD 


€(u) 


ll 


Pix-pi. (38) 


Substituting (38) into (35), and using the orthogonality property of 
distinct eigenvectors, we find that 


N 
E(u) — Bop =u (pix) (39) 


1 
A(Ai + ps)? 
Thus, to a first approximation, the increased mse grows only as the 
square of the leakage parameter, 1, while the eigenvalue distribution— 
and the range in which the taps can wander—can be favorably altered, 
in a significant manner, by using even a very small value of p. 


* The fluctuation about the minimum mse, caused by the finite step size, will also be 
examined. 
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4.2 The adaptive tap-leakage algorithm 


In a manner analogous to the commonly used estimated gradient 
algorithm, the adaptive tap-leakage algorithms are constructed, from 
(30a), by minimizing the augmented instantaneous squared error, 
en + pene, and, from (30b), by minimizing e? + » YX_y |c®|. The first 
algorithm modifies the gradient by a term proportional to the tap 
vector itself, giving the algorithm 


Cn+i = Cn — a[en¥n + pen] 
= (1 — apu)Cn — aep¥n, (40a) 
while the second algorithm is of the form 
Cn+1 = Cn — a(en¥n = uu sgn Cn) 
= Cn — QL SBN Cn — Aen, (40b) 
where the sgn operation is applied individually to each component of 
the tap vector. Note that from an implementation point of view, the 
algorithm can be modified with almost no hardware change other than 
applying a systematic decrement to the magnitude of each tap. The 
second algorithm (40b) has the practical advantage that adjustments 
will continue to be made no matter how small any tap weight becomes, 
while the first algorithm has the “advantage” of analytical tractability. 
Consider now the mean tap error when the leakage algorithm (40a) 
is used. In the presence of digital bias, the algorithm is modeled as 
Cn+1 = Cn a alenYn + b + LCn]. (41) 


Subtracting co»: from both sides of (41), and solving for the steady- 
state average tap error we find 


ll 


(e) = (A+ pl)~"b + (A+ pl)~"pA~*x 
N , N 
pib pix 
(42 
ee ee 





The first term, on the right-hand side of (42), is similar to (22), but 
note that the eigenvalues have been modified to eliminate the cata- 
strophic effects that can accompany vanishingly small eigenvalues. 
The second term, which is proportional to the leakage parameter, is 
similar to (39), and represents the increased tap error caused by the 
minimization of J; and not #. The leakage parameter, », must be 
chosen sufficiently large so that the first term is properly controlled in 
magnitude, but not so large that the magnitude of the second term 
becomes appreciable. In general, the choice of u is best done empiri- 
cally, but if » is chosen to be in the range where Aminimum + PL ~ pt, then 
equating the magnitude of the two terms in (42) gives 
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N 
Y (pib)? 
i=—N 
N (pix)? 
Zoe 


i=—N 


ia (43) 


For any given channel, (43) can be evaluated, but as far as the average 
tap error is concerned, it is sufficient to choose p in such a way that 
the magnitude of (€) is within the nonshaded region of Fig. 4. From 
(42), one reasonable choice is to make p roughly equal to the smallest 
value of A; for which p;x is significantly larger than );. 

To assess the effect of the tap-leakage algorithm, (40a), on the 
steady-state mse we recall eq. (18) 


En a Eopt + Qn, 


where Fopt is the minimum mse when both b and yp are zero. From (41) 
we can compute an approximation to the steady-state value, g., in a 
manner similar to the computation of (28). For the adaptive tap- 
leakage algorithm we find that the fluctuation about the minimum 
mse is 


_ oAm[(2N + 1)-(r?(nT)) Eon + b’b] + 2ayb’x + p’x’A~'x 


44 
2X + w(1 — ad) — afAm(2N + 1){r?(nT)) + 7] ms 


Obviously, (44) is very channel dependent, but when yp is chosen on 
the order of a small eigenvalue, then the fluctuation about the mini- 
mum mse is a rather insensitive function of the leakage parameter, 
while the magnitude of the mean tap error, (42), can be effectively 
controlled by the proper choice of py. 

In the next section we will discuss the results of laboratory experi- 
ments that use the tap-leakage algorithm to control the potentially 
unstable operation of a fractionally spaced equalizer. 


V. LABORATORY EXPERIMENTS 


For an experimental 9.6-kb/s data transmission system using 16- 
point quadrature amplitude modulation, the tendency of the tap 
coefficients of a digitally implemented equalizer with T/2 sample 
spacing to drift is demonstrated by the waveforms of Figs. 6a and 6b. 
These waveforms are analog representations of the values of one 
component of a set of complex tap coefficients associated with data 
samples taken T(=1/2400) second apart. The equalizer is physically 
constructed by using two conventional J-spaced interleaved struc- 
tures, requiring four tap coefficient component distributions to totally 
describe the state of the equalizer. 

The equalizer goes through a conventional start-up procedure, ex- 
cept that timing recovery and carrier phase adjustments are suppressed 
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(a) (b) 


Fig. 6—Tap buildup as a function of time. (a) Three-minute intervals. (b) Five-minute 
intervals. 


so that the distribution of tap coefficients will not change through 
interaction with either of these operations. The transmitted signal is 
fed back to the receiver, after passing through appropriate attenuators, 
thus avoiding any time varying channel characteristics, and assuring 
a low noise environment. 

The first trace of Fig. 6a illustrates the distribution of components 
among the particular collection of coefficients immediately after start- 
up. A single large negative component is noted, with all other compo- 
nents relatively insignificant. The coefficients are updated in the usual 
fashion, via (14), without the addition of a tap-leakage adjustment. 
Subsequent traces in Fig. 6a are taken at 3-minute intervals. The 
traces of Fig. 6b are a continuation of Fig. 6a with the separation in 
time extended to five minutes. A clear pattern of buildup in the 
amplitude of the taps, particularly those immediately preceding the 
original dominant tap, is demonstrated. A similar compensatory 
buildup occurs among those tap coefficient components not displayed, 
such that the mse is essentially unchanged over the duration of the 
test. This deterministic growth of tap amplitudes will eventually lead 
to saturation of shift register accumulators used in forming the various 
components of the equalizer passband outputs. The observed output 
signal constellation will display frequent apparent noise-like hits of 
large amplitude, and an unacceptable output error rate results. 

The tap-coefficient components in the laboratory configuration are 
stored in 24-bit shift registers. The 12 most significant bits are used in 
the multiplication to form tap-product outputs. The remaining bits 
were to average out the effects of tap updating, which is normally done 
according to the rule 


Cn+1 = Cn a a(en¥n) . 


The components of the updating quantity —a(e,r,) are stored in 12-bit 
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words which are added to the 20 most significant bits of the coefficients 
during the normal steady-state mode of operation. To counteract bias 
in the arithmetic, the updating is changed to the tap-leakage algorithm 


Cn+1 = Cn — B sgn Cn — alenYn), (45) 


where 6 = ap of (40b). In the experimental setup, a count of 1 is added 
or subtracted to the 23rd most significant bit of each component of 
each tap coefficient once each symbol interval. In steady-state opera- 
tion, the 12-bit updating signal will typically show activity in a mini- 
mum of the five least significant bits. In general, therefore, the leakage 
term is quite small compared with the conventional updating term. 

The effect of introducing this leakage is shown in the waveforms of 
Fig. 7. The top trace shows the coefficient distribution 40 minutes after 
initial start-up, at the time the leakage is enabled. Subsequent traces 
were taken at 30-second intervals. Within two minutes the coefficient 
components had been virtually restored to the state that existed 
immediately after start-up. 

The experimental arrangement allows the leakage to be scaled over 
a wide range. Viewing the 24-bit coefficient as an integer, the leakage 
increment can be made 2’,r = 0,1, --- , 7 (r = 1 is the case displayed). 
Since the coefficients are chosen to span an analog range of +4, this 
corresponds to £ as defined in (45) ranging from 2~”' to 2”, with B = 
2°*° displayed. The value of a used in the experiment is a = 27". It is 
observed that 6 = 2~”’, the lowest possible level of continuous leakage, 
is adequate to suppress tap drift, indicating the extreme low level of 
the system bias to which the FsE updating algorithm appears suscep- 
tible. 

The larger the value of 6 that is chosen, the more the leakage will 
degrade the equalizer performance, although the degradation is negli- 
gible for B less than 27’. In normal data-set operation, it is observed 


<— TIME 





TAP VALUES ——& 


Fig. 7—The effect of the tap-leakage algorithm (30-second intervals between traces). 
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that a substantially rapid shift in the sampling epoch, which may occur 
during the timing recovery operation, will greatly accelerate the 
buildup of tap coefficients. The choice of B = 27" will allow rapid 
response to this situation without perceivable performance degrada- 
tion. 


VI. CONCLUSIONS 


Effective control of tap drifting for a fractionally spaced equalizer, 
at a 9.6-kb/s data rate, has been demonstrated by employing the easily 
implemented tap-leakage algorithm. The tap-leakage algorithm, or 
some variation of it, might be appropriate for any digitally imple- 
mented adaptive system that has too many degrees of freedom and 
that exhibits coefficient wandering. 


APPENDIX 


Asymptotic Distribution of the Eigenvalues and Eigenvectors for 
Synchronous and Fractionally Spaced Equalizers 


In this appendix we describe the eigenvalues and eigenvectors of 
infinitely long synchronous and fractionally spaced equalizers. 


A.1 Synchronous equalizer 


First we recall the eigenvalues and eigenvectors of an infinitely long 
synchronous equalizer. From (7) we have the eigenvalue equation 


N 
>s Ap-ipi = App, NSkE=N, (46) 
l=—N 

where A is an eigenvalue and p’ = (p-n, -::, Do, -++, Pn) is the 
associated eigenvector. As N — o, taking the Fourier Transform of 
both sides of (46) yields 


T 





A(w)P(w) =AP(w), — fw|[s , (47) 
where 
A(w) = px(o+ ae o 
= | Xeq(w) |? + 0, Jo] <5. (48)* 


The only way for (47) to be satisfied [with P(w) A 0] is for P(w) to be 
concentrated at a single frequency [unless A (w) has the same value at 


* Recall that the Nyquist-equivalent spectrum X.,(w) is defined as X 


¥ X(w + k22/T). 
R 
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more than one frequency]. If we let 


i 


oe 
i= We 
then the solution to (47) is 
Ai = A(wi) 
-NsisN 
P;(w) = 6(w — wi). (49) 


Thus, for a synchronous equalizer the asymptotic (N — ©) eigenvalues 
uniformly sample the folded-channel plus noise spectrum, and the 
eigenvectors are the corresponding sinusoids. 


A.2 Fractionally Spaced Equalizer 


Here the channel-correlation matrix, while symmetric, is not 
Toeplitz; thus Fourier Transform techniques do not yield the eigen- 
values and eigenvectors in the above short order. For convenience we 
consider the noiseless situation, and the eigenvalue equation becomes 


¥ ART’, IT’)p(lT’) = Ap(kT’) -N<k<N, (50) 
l=—N 
where the channel-correlation matrix has elements 


A(kT’, IT’) = ¥ x(mT — kT’)x(mT — IT’). (51) 


With T’ = T/2 and N — o we write (50) for even and odd values 
of k 


i hee T T _T T 
34(k5 03) p (5) +p a(agtg)p (5) 


T 
= Xp ( 3). keven (52) 


Pt T TT T 
At{k—,l— [—)+ Aflk—,l— l— 
on ( 2 5) ( 4 24 ( 2 5) ( ) 


= Ap ( 3). kodd. (53) 


Now (52) and (53) can be written respectively as 

T T 
y} A(RT, IT) p(lT) + ¥\ A (RT, IT) + zP (or + 5) 
rf ZL 


=\p(kT), —-0o<k<o (5A) 
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and 


pa(arss, ir) p(t +ya(ares, +s) (5) 
2 2 2 2 


= Ap (er + 5) , (55) 


where both equations hold for all integer values of k, and, more 
importantly, the various component matrices are now all Toeplitz.” If 
we let 


Ka(v) & X(e) -X(0- 2) -x(0 + Z), Jol sz (56) 


and 
Pera Ple =P list Jo|<s—, (57) 
= T i i ae a 
then taking the synchronous Fourier Transform (i.e., with respect to 


the JT seconds sampling interval) of (54) and (55) gives 


| Xeq(w) |?P(w) + Xeq(w)X %4(w) P(w) = AP (w) 0<|w/<s (58) 


T 
T 
and 

Xeq(w)X &(w)P(w) + | Xeq(w) |?P(w) = AP (w). (59) 
Note that p(kT + T/2) has the Fourier Transform 

Tt 
e 2p (w), 

while the Transform of p(kT) is of course P(w). Arguing as we did for 
the synchronous equalizer, we see that the ith eigenvectors P;(w) and - 
P;(w) must again be delta functions at w; = i/N 7/T. Upon setting the 
determinant of the pair (58) and (59) to zero, we see that the eigen- 
values satisfy 


A? — Ai|| Xeqlwi) |? + | Xeq(wi) || = 0, (60) 
and thus for each value of w; there are two eigenvalues 


i=0 
2 


Ai= | Xeq(wi) |? + | Xeq(wi) ‘fe = 2 [x(« + om) (61) 


T 





In contrast to the synchronous equalizer, half of the eigenvalues are 


* For example, A(kT, [T + T/2) = ¥, x(mT — kT) x(mT — IT + T/2) = Yn x(nT)x|nT 
+(k-1)T + T/2|. 
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exactly zero, while the other half are samples of the aliased magnitude- 
squared channel transfer function. Not surprisingly, the eigenvalues 
are independent of the receiver sampling phase. Once the eigenvalues 
are determined we can solve for the eigenvectors. The ith eigenvector 
associated with the zero eigenvalue is 


X* (wi) e2°"7, n even 
Di (x 4 = al ) slot) (62) 
2 —Xt (wie \ ? n odd, 
while the eigenvector associated with the nonzero eigenvalue is 
7 Xeq(wi) ei", n even 
t n— = ~ fw,{ 1 4 (63) 
( ) Keq(wi)e” caer 


At this point we remark that when «; is not in the rolloff region then 
Xeq(Wi) = Xeq(wi), and (63) describes ‘a sinusoid of frequency w;, since 
the even and odd portions of p;(n T/2) mesh together in a continuous 
manner [i.e., 


piln T/2) = Xeq(wide””™”]. 


However, (62) describes a function that changes sign and oscillates 
almost a full cycle in T seconds. Consequently, p;(n T/2), as given by 
(62), will have most of its spectral energy concentrated near 1/T Hz. 
When w; is in the rolloff region, the frequency content of (62) and (63) 
will differ somewhat from the above extreme cases but the general 
results will still be as above. 
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Using Magnetic Bubble Memories to Provide 
Recorded Announcements 


By J. E. ROWLEY and J. BERNARDINI 


(Manuscript received September 20, 1981) 


Recorded announcement systems have traditionally been electro- 
mechanical in nature, and have required routine maintenance and 
adjustments. An electronic system, the 13A Announcement System, 
was designed to replace many of these older systems and eliminate 
the need for routine maintenance. It is the Bell System’s first appli- 
cation of magnetic bubble memories, and can provide from one to 
eight announcements of up to 24 seconds in length each. Entering the 
fifth year of production, the 13A is being installed in almost every 
type of central office of the telecommunications network. 


|. INTRODUCTION 


In the Bell System, audio recorders and announcement systems are 
used for a variety of applications to provide customers and operators 
with information. One group of applications are those associated with 
calls that cannot be completed. Announcements in this group tell the 
customer why the call was not completed. Examples of problems 
explained by these announcements are: non-working numbers, incor- 
rectly dialed calls, damaged equipment and heavy traffic conditions. 
In general the announcements associated with these conditions are 
non-revenue producing. These announcements are fairly brief, many 
lasting only twelve seconds. Another major group of announcements 
are those which are customer requested. These announcements are 
longer, often about a minute in length. Examples of these are: the 
weather, stock market quotations, the news, department store sales, 
jokes, sports phone, etc. In general this group of announcements are 
revenue producing. 

Traditionally, the machines used to provide these announcements 
have been categorized as being either light-duty or heavy-duty ma- 
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chines. The light-duty machines were generally designed for customer 
premises or for central offices where their use was not continuous. The 
heavy-duty service machines were characterized by a much higher 
calling rate, requiring that the machines run continuously 24 hours a 
day. 

The majority of the recording machines presently being used by the 
Bell System are approximately 25 years old in design. These recorders 
are all basically electromechanical in technology, employing magnetic 
drums for analog storage. In addition these designs require magnetic 
heads, mechanical drives, and a variety of moving parts. The problem 
associated with these designs has been the need for frequent routine 
maintenance in the form of lubrication, mechanical adjustments, and 
the replacement of worn parts. There is also a high cost associated 
with the special mechanical skills required for this type of maintenance. 
Other drawbacks of these electromechanical systems include their 
large physical size and their large power requirements. 

In contrast a solid-state design employing digital storage has the 
advantage of no moving parts and, hence, no routine maintenance. It 
also has the advantage of high reliability, extremely long life and small 
size. This technology also leads itself to modular construction with 
standard components and circuit boards. 

A study conducted in 1975 showed that for the shorter announce- 
ments a memory system employing magnetic bubble technology would 
be economically attractive. The Bell System had extensive develop- 
ment under way in this technology. It was clear that the magnetic 
bubble memory architecture was ideal for use in a solid-state an- 
nouncement system and that the nonvolatility which the bubble mem- 
ory offered was very desirable for this application. The large storage 
capacity of the bubble memory was another important consideration. 
All of these factors combined to make magnetic bubble memories the 
ideal choice for a storage medium. 

In the attempt to define what type of announcement system would 
be appropriate for the present and near future market, a survey of 
operating companies was conducted in 1975. This survey showed that 
90 percent of the shorter announcement market could be served by an 
eight-channel system. It also indicated that 85 percent of the market 
would be served by a system capable of providing recordings up to 24 
seconds in length. To minimize digital storage requirements various 
digital coding techniques were considered. Based on human factors 
tests, adaptive delta modulation (ADM) at 24 kb/s was selected as the 
best compromise of bit rate and intelligibility. 

With message length, bit rate, and multiple channel capability 


f WA _atate an incement 
defined, the basic design requirements of a solid-state announcement 


system, coded the 13A, were established. 
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ll. GENERAL DESCRIPTION OF THE 13A ANNOUNCEMENT SYSTEM 


The design of the 13A system evolved as a multichannel system that 
can record and play back up to eight messages. Each message can be, 
and usually is, different. With appropriate distribution circuits, each 
message can serve up to 500 customers simultaneously. For many 
applications, the 13A is used as a direct replacement for up to eight 7A 
electromechanical machines. Figure 1 shows the 13A as compared to 
eight commonly used 7A machines in standard central office frames. 
Figure 2 shows the 13A fully equipped with eight message modules. 

To minimize the cost of the system, circuitry that is common to all 
messages resides on three circuit packs at the left side of the system. 
A minimum 13A system requires one additional message-module cir- 
cuit pack, which contains the memory and other circuitry necessary 
for that particular message. The remaining seven positions at the right 
side of the 13A can be equipped or not, as desired. 

There are two different message module circuit packs. Both types 
offer a variable-length message feature. One records announcements 
from 3 to 12 seconds in length, and the other can record messages up 
to 24 seconds long. The message length can be adjusted in 3-second 
increments by means of a thumb-wheel switch on the message module 
pack. Each pack can be set to a different message length, if desired, 
contingent upon its associated switching system. 

The 13A requires a maximum of 77 watts from the -48 volt central 
office battery for eight channels. This is an improvement over previous 
announcement machines, such as the 7A, which requires 30 watts from 
110 volts ac for one channel. 

Each message channel of the 13A has a START input which controls 
the start of a message. After a closure to ground on this input, the 
message starts at the beginning with an average waiting time of 2.5 
seconds. As long as the start input is grounded, the message will 
continue to repeat with a 3-second silence interval between message 
repetitions. 

In addition to the message audio, several other outputs are available 
on a per-channel basis. To provide interchangeability between the 7A 
and the 13A, 7A output signals and their nomenclature were adopted. 
These outputs are contact closures that provide timing information 
that can be used by trunk circuits to ensure that the customer hears 
the message from the beginning, rather than from any point within 
the message. 

Each message-module channel also provides a voice-alarm output. 
A contact closure on this pair of leads signals an absence of audio 
output from the message channel. A loss of power to the 138A system 
will also result in a voice alarm. A system input called VATEST can 
be used to test the operation of the voice-alarm circuit. 
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Fig. 1—Frame-mounted 13A Announcement System (left) compared to eight 7A 
units, 


There are two methods for recording a message on the 13A. One 
method is to use a G3CR-type handset. The preferred method is to 
use a prerecorded tape and dub the message from a cassette recorder 
(typically) onto the 138A. The latter method produces the highest- 
quality announcement, since the tape can be prepared in a quiet 
environment by a professional speaker. The 138A provides a tape input 


1844 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1982 


ame Cnn RECHEN I. Ob SONES Oet SoU Cente 
1 SHER MEE HD EEN WOE 8, 8 
ah a re a 3 
womnae ats, 


Eee diedeh em becbusantudardant 
_ * oe eee eae pens a : - 
soon deatorheemnnobere te he laenmebemsamenhonate’ 





Fig. 2—The 13A Announcement System with eight message modules. 


jack for this purpose. With the introduction of special information 
tones on standard intercept, vacant code, no circuit, and reorder 
announcements in 1981, dubbing from prerecorded tapes will become 
the standard method for recording these messages. 

A simplified block diagram of the 13A system is shown in Fig. 3.’ In 
recording, the input speech is filtered, converted into a digital repre- 
sentation using adaptive delta modulation (ADM), and written into the 
29A magnetic bubble memory. Reproducing the message involves 
recovering the data from the memory, conversion back to an analog 
signal, and amplification for driving the output, which interfaces with 
trunk circuits. A large portion of the 13A circuitry is used to generate 
signals that are peculiar to the 29A memory. Therefore, some knowl- 
edge of the bubble memory is helpful in understanding the 13A system. 


lll. THE 294 MAGNETIC BUBBLE MEMORY 


The 29A package, shown in Fig. 4, is a 32-pin dual in-line package 
2.49 inches long and 1.2 inches wide. A permalloy outer case protects 
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Fig. 3—Block diagram of the 138A Announcement System. 
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Fig. 4—The 29A bubble memory. 


the bubble chips, rotating field coils, erase coil, and bias magnet from 
stray external fields.” 

The package contains four 68,121-bit chips for a total of 272,484 bits. 
At the 24-kb/s encoding rate, storage capacity is approximately 3 and 
12 seconds for chip and package, respectively. Each chip, approxi- 
mately 5 mm by 6 mm, is organized as a single serial shift register. 
Ones and zeros are represented by the presence or absence of bubbles, 
respectively. Generation of new information and detection of the 
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Fig. 5—The 29A chip organization. 


existing memory contents are both done outside the 68,121-bit shift 
register loop. 

A line diagram of the chip organization is shown in Fig. 5. A bubble 
is generated at point G by an appropriate current pulse in the G 
conductor. Eight steps later, the bubble is at the write transfer point, 
W. If no current pulse is present in the R conductor, the bubble will 
travel along the path which ends in an X and be annihilated. If the 
proper current pulse is present in the R conductor, the bubble will 
take the other path, which will put the bubble into the data loop at 
point M. This is a passive merge point, and no signals are required to 
merge the generated bubble into the 68,121-bit loop. 

Once in the main loop, the bubbles require no conductor signals to 
keep them circulating in the loop. However, no bubbles will go to the 
detector unless a proper current pulse is applied to the R conductor. 
Two different types of pulses can accomplish this. A transfer pulse, 
applied to the R conductor, will cause all bubbles at point R to travel 
to the detector. A series of 68,121 of these could be used to clear the 
memory. Or, in combination with the generator lead, a smaller number 
of transfer pulses could be used to rewrite a portion of the 68,121 
locations. Another pulse, a replicate pulse, will cause a bubble at point 
R to split into two. One bubble stays in the main loop, the other travels 
to the detector. This pulse type is used for nondestructive readout of 
the memory’s contents. The transfer pulse is not used in the 138A, since 
the 138A does not write a portion of a chip. In the 13A, clearing of the 
29A memory is done by using the z coil. 

In the z axis, which is perpendicular to the plane of the chip, there 
is a bubble-stabilizing bias field, nominally 160 Oe, which is supplied 
by a pair of permanent magnets inside the package. This field can be 
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aided or opposed by a current in the z coil. The z coil is used in 
manufacture to test the bias margins of the four chips. A much larger 
z coil current, aiding the bias field, will cause collapse of the bubble 
domains and can be used to bulk erase the entire memory. 

In a semiconductor shift register, a clock signal advances the mem- 
ory. In the 29A, an in-plane, constant-magnitude, rotating magnetic 
field does this. One 360-degree revolution of the rotating field advances 
the bubbles by one position. The rotating field is formed by applying 
two sinusoidal currents, 90 degrees out of phase, to the inner and outer 
field coils. In the 138A, these coils are driven by voltage sources, so the 
inductance of the coils must be tightly controlled in order to obtain 
control of the current amplitudes. At the 48-kHz rotating field fre- 
quency, the required inner coil drive voltage is 25.15 volts peak to 
peak, and the current is 0.50 ampere peak to peak. The outer coil 
requires 25.15 volts peak to peak at 1.34 amperes peak to peak. The 
tolerance on both voltages is +5 percent. Because the Q of the two 
coils is different, the desired 90-degree phase shift in currents is 
achieved by having the outer coil voltage lead the inner coil voltage by 
95 + 3 degrees. 

The timing of the other functions of generate, replicate, and strobe 
(for data detection) is related to the phase angle of the rotating field. 
The timing requirements, as well as the current amplitudes, are shown 
in Fig. 6. It should be noted that the function drives must be current 
sources, since the function resistances have a tolerance of +20 percent 
and the current tolerance, for example, of high replicate current is 110 
mA + 13.6 percent. 

In the 29A package, the generators of all four chips are connected in 
series, and one pair of generator leads is brought out. Similarly, the 
detectors and dummy detectors are connected in series. The replicate 
leads are brought out individually, and they function as chip-select 
leads, determining which chip is being written into or read from. 

The bubble arrives at the detector 60 rotating field cycles after being 
replicated at point R. The bubble field is detected by a magnetoresis- 
tive detector. Since other fields (especially the rotating field) are 
picked up, a similar dummy detector, under which no bubbles pass, is 
included to cancel out unwanted signals. Both active and dummy 
detectors are driven by constant current sources of 4.5 mA + 0.2 mA. 
The match between the two current sources must be better than 1 
percent. One side of the active and dummy detectors is connected 
together inside the 29A, and external circuitry must place this node at 
ac ground. The remaining two leads, one active and one dummy, must 
be amplified differentially. Since the detectors are insensitive to field 
polarity, the unwanied common mode signal is mostly at twice the 
rotating field rate and is about 80 mV peak to peak. 
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The desired bubble signature occurs during the strobe interval and 
is seen differentially across the active and dummy leads. To obtain a 
digital result, the signal is processed as shown in Fig. 7. A “1” (bubble) 
signature is typically 5 mV in magnitude, and a “0” response is 
somewhat smaller, as shown in Fig. 8. A technique known as dc 
restoration is used after this waveform is obtained. This references the 
signal level to the level at the start of the strobe interval. That is, 
within the strobe interval: 
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Fig. 6—The 29A timing requirements. 
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Fig. 8—Amplified bubble signal before dc restoration. 
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where V(0) is the signal amplitude at the start of the strobe interval. 
Since V(0) is higher for 0’s than for 1’s, this process increases the 
separation between 0’s and 1’s, as shown in Fig. 9. 

The specifications of 29A outputs are based on dc restored wave- 
forms. Output amplitudes vary widely from module to module. In the 
13A, these variations are dealt with by varying the amplitude of a 
decision threshold until it is one-third of the distance between the 0 
and 1 amplitudes. 

One advantage of bubble memories is that data can be retained 
during loss of power, provided the rotating field signals are turned off 
properly. The rotating field must turn on and off within the +45- 
degree segment as shown in Fig. 10. Failure to do this will result in loss 
of memory contents. 

The 29A memories undergo extensive testing during manufacture.° 
This inciudes test of worst case amplitude and timing margins at room 
temperature, as well as a +3 Oe bias margin test. In addition, each 
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device must pass a less severe test at both 13°C and 60°C case 
temperature. 


IV. THE AUDIO ENCODING-DECODING CHAIN 

The 13A, viewed as a digitized audio system, is shown in Fig. 11. 
The signal flow can be traced through the audio-processing stage, the 
audio-encoding section, and the bubble memory. After the memory, 
the signal goes to a decoding circuit and then to an output amplifier. 
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Fig. 11—Audio-chain and error-detection circuitry. 


The first stage is a balanced amplifier which provides an input for 
a T1-type transmitter or a tape recorder source. The input of this stage 
is designed to bias a T1-type transmitter for up to 850 ohms of loop 
resistance and provide ac coupling for a tape recorder input. 

The aGc amplifier provides 40 dB of input dynamic range to allow 
for various speaker levels, record levels, and loop resistance losses. 
With the input dynamic range compressed to 4 dB, an equalizer 
provides a signal increase at a slope of 6 dB/octave above 1 kHz and 
a signal decrease at 6 dB/octave below 1 kHz. This preemphasis is 
coupled with a corresponding deemphasis at the decoder output so 
that a flat frequency response is maintained. The overall preemphasis/ 
deemphasis was subjectively determined to improve the response of 
the encoder/decoder chain by eliminating quantization noise at the 
decoder output. The last analog stage before encoding is a third-order, 
2.8-kHz, low-pass filter defining the 13A bandwidth. 

The filtered audio signal is digitized using an adaptive delta modu- 
lator (ADM) encoder operated at 24 kb/s. This is an ADM chip-set 
originally developed for the SLC*-40 system and operated at approx- 
imately 38 kb/s.* For the 138A, the reduced clock rate is made possible 
because of the input acc and the reduced importance of speaker 
identification. The output of the ADM encoder is processed by a 
preamble pattern inhibiter. The purpose of this circuitry is to inhibit 
bit patterns of 15 or more 1’s and patterns of 15 or more 0’s. A pattern 
of 15 or more 1’s is modified by changing the eighth 1 to a 0. A pattern 
of 15 or more 0’s is modified by changing the eighth 0 to a 1. This 
procedure ensures a unique bit pattern for use as a data preamble. 
With the apm encoder used in the 138A, patterns of greater than 14 1’s 
or 14 0’s have a low frequency of occurrence. This procedure, therefore, 
introduces a negligible distortion into the ADM encoding. The preamble 
is generated by inserting a bit sequence of 15 1’s surrounded by 0’s as 
the first data loaded into the bubble memory during a message record- 
ing. The bubble memory output is converted to logic levels by the 
detector amplifier. The detector drives both an apm decoder and 
circuitry associated with preamble detection, synchronization and 
error detection (see Section VIII). The Apm decoder decodes the 
digitized speech into audio. The inverse equalizer normalizes the 
speech to a flat spectrum and the low-pass filter eliminates quantizing 
noise residing outside of the audio band. 

The final stage in this chain is a low-output-impedance amplifier 
capable of driving up to 500 trunk circuits at a maximum level of -9 
volume units. Referring to Fig. 11 it should be noted that all circuitry 


* SLC is a trademark of Western Electric. 


MAGNETIC BUBBLE MEMORIES’ 1853 


after the preamble generator is duplicated for each message channel. 
All circuitry before this point is shared by all message channels for 
both record and playback operations. 


V. TIMING AND FIELD DRIVE 


A major consideration in the use of bubble memories is the overhead 
represented by the necessary circuitry for control and timing-signal 
generation. In the 13A, this circuitry takes the better part of two 
printed-circuit boards. However, this circuitry is shared by all message 
modules in the system. 

Figure 12 shows a functional diagram of the circuitry for the field 
drive, field-cycle timing, and chip-cycle timing signals. The field drive 
used in the 13A is sinusoidal. The basic system timing is derived from 
the crystal oscillator which drives the control and timing counter. The 
control counter in turn supplies a clock to the quadrature sinewave 
generator, chip-timing counter and the timing Read Only Memory 
(ROM). The start and stop timing for the quadrature sinewave generator 
and for the application of the generator outputs to the power amplifiers 
is provided by the timing ROM. The timing ROM also provides basic 
timing for the field-cycle timing signals previously described. The chip- 
timing counter provides signals necessary for timing during the chip 
cycle. 

On the message-module boards, where the bubble memories reside, 
are capacitors in parallel with both the inner and outer coils. These 
form parallel resonant circuits at the 48-kHz rotating field frequency. 
They reduce the power required for steady-state operation by a factor 
of 10, but they also change the start-up and shutdown loads that the 
power amplifiers drive. 

To maintain nonvolatility with bubble memories it is important that 
the power up and power down cycles occur within the +45-degree 
quadrant of the field drive cycles. When a system is powered up, the 
timing ROM starts the quadrature sinewave generator, allowing enough 
time for the generator to stabilize; then the timing Rom provides the 
start signal for the closure of the analog switches. The analog switch 
associated with the outer-coil drive is closed first. After a delay 
corresponding to 95-degrees of the field drive cycle, the inner coil drive 
analog switch is closed. At this point the inner and outer coil field drive 
signals must meet the previously outlined amplitude and phase re- 
quirements. 

For nonvolatility upon power down, the inner and outer coil drive 
analog switches are shut off simultaneously at the point in the field 
drive cycle where the outer coil current is a maximum and the inner 
coil current is a minimum. This resuits in a gradual decay of the outer 
coil current and a slight oscillatory decay for the inner coil current. 
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Fig. 12—Field and chip timing circuitry. 


The resultant field vector lies within the +45-degree sector necessary 
to maintain nonvolatility. 

The final links in the chain for the inner and outer coil field drives 
are the associated power amplifiers. To maintain nonvolatile bubble 
memories during steady-state, transient, and variable-loading condi- 
tions, these amplifiers must be carefully designed. During steady-state 
operation, the amplifier must supply a sinusoidal signal whose ampli- 
tude must stay within +5 percent and whose phase must not vary by 
more than +1 degree for a load variation of 1 to 16 field coils. When 
the field drive signals are first turned on, the field drive loads appear 
capacitive. For this case, the power amplifiers must be capable of 
supplying up to 16 amperes at the sinusoidal peak voltage (12.57 volts). 
On turn-off, the power amplifiers must exhibit the transient response 
and low output impedances necessary to dampen the field coil energy. 
The power amplifiers also include circuitry to inhibit extraneous 
signals from reaching the field coils when the power supply voltages 
are below 13 volts. This is necessary to deal with transients occurring 
during low-voltage conditions when the 13A system is powered up or 
down. 


VI. POWER CONSIDERATIONS 


To ensure a nonvolatile system, the 13A must sense loss of power 
and turn the rotating fields and timing signals off before power supply 
voltages decay to the point that the control logic will no longer 
function. On power up, the system must hold the fields and timing 
signals off until power supply voltages are within normal limits. Figure 
12 shows the functional circuitry and signals related to the 13A power 
start-up and shutdown operations. 

When the dc voltages from the power supply are within 5 percent of 
their nominal value, the power sense signal is sent to the control logic. 
The power sense signal initializes all the logic and starts the simewave 
generator via the timing Rom. Allowing sufficient time for the sinewave 
generator to stabilize, the field drive and field cycle timing signals are 
switched on. 

If the -48 volt power is lost, or the power switched off, the power 
sense signal is switched low when the dc voltages have decayed by 5 
percent. This signal, delayed by the control logic, switches off the field 
drive signals and associated field cycle timing signals at the proper 
time in the rotating field cycle. The maximum delay is one cycle of the 
rotating field, or 21 us. 

Although the previously described power-down sequence maintains 
13A nonvolatility, it causes the messages to be stopped at a random 
point in the chip cycle. Message boards powered down in this manner 
cannot be interchanged between different 13A’s without introducing 
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preambles at different relative positions. The ambiguous preambles 
would not permit proper synchronization of the 13A system. This, of 
course, would not be a problem for a single-channel 13A. To permit 
interchangeability of message modules for multichannel 13A’s, the 138A 
can also be powered down by first employing the Field Down switch. 
Operating this switch (before switching the power off) turns the field 
drive signals off near the beginning of the chip cycle. This ensures that 
all message modules fielded down in this manner will have preambles 
in the same relative position. This feature cannot be ensured for 
emergency power down, or loss-of-power cases, because the control 
timing would have to delay the field down switching for up to one chip 
cycle (2.84 seconds) to reach the proper point in the chip cycle. 


Vil. RECORD AND PLAYBACK CONTROL 


The record and playback operation of the 13A is controlled locally 
via circuit-card edge switches and light-emitting diodes (LEDs). The 
audio source can be either a telephone handset or a tape recording. 
Message modules can also be prerecorded at one site and installed in 
systems at other sites, provided the proper precautions are taken using 
the Field Down switch. 

The record operation starts with the record-request signal generated 
from a card-edge push-button switch. This sets the record-request flip 
flop, which in turn lights a record-request LED on circuit pack UD3’s 
card-edge. When the chip counter arrives at the appropriate point of 
the chip cycle, the record process is started by pulsing the z coil of all 
accessed message modules to clear the data recorded previously. Next, 
the preamble pattern is generated. The record-request LED is turned 
off and the record LED turned on as a signal to input the audio. At the 
end of each chip, the generated data is switched to the next chip until 
the last-chip signal occurs. The last-chip signal turns off the record 
LED, which signals the end of the record interval. 

The playback operation can be controlled either locally or remotely. 
With the selected channel started remotely or accessed locally, the 
playback operation begins. This involves replication and detection of 
bubble signals, decoding the ADM data, and amplification for driving 
trunk circuits. At the end of each chip cycle, the replication process is 
switched to the next chip in the bubble module. If the playback 
involves two bubble memories, switching between them is not imme- 
diate. Owing to the delay between the replicator and the detector 
inside the 29A, switching of the detector from the first to the second 
module must follow switching of replication by 30 data cycles. 

The setting of the thumb-wheel message-length switch determines 
which chip is the last chip in the message. At the end of the last-chip 
signal, the message ends and, if the start or access signals are present, 
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a 2.84-second silence interval begins, after which the message repeats. 
During the silence interval, several relay signals are given to signal the 
end of the message and the start of the next message repetition. 


Vill. PREAMBLE SYNCHRONIZATION AND ERROR DETECTION 


In the 13A system, the stored-message-synchronization and error- 
detection techniques were combined into common logic. The design 
requirements which prompted this approach were based on a need to 
provide synchronization of stored messages to the control counter and 
inexpensive means of logic fault detection. It was considered important 
that the fault-detection scheme be capable of detecting stuck-at- 
one(sAl1) and stuck-at-zero(sa0) conditions in the data paths, and at 
the same time tolerate small or transient errors in the data. It was also 
considered desirable that the system be able to detect and recover 
from conditions when the control counter was out of synchronization 
with the messages. 

Referring to Fig. 11, the data stream of the ADM encoded speech is 
first processed to inhibit preamble patterns and their complement 
from the data stream. A preamble of 15 1’s is then first generated and 
stored in the bubble memory. When reading the data from the bubble 
memory, the data is first applied to the preamble detector noninverted. 
The output of the preamble detector, the match signal, is used to time 
synchronize the chip timing counter. Normally the first preamble 
match signal that occurs after powering up is employed to provide this 
synchronization. Following this operation, all 13A messages are chip- 
cycle synchronized to the control logic. All further occurrences of the 
match signal are compared with the chip-timing counter signal, TR. 
With a normal 13A system, match signals occur timed to TR. In this 
normal mode, every 60 seconds (enough time for two cycles of the 
longest 13A message) the data from the bubble store is inverted and 
applied to the preamble detector logic. During this cycle, the preamble 
detector in effect searches for 15 0’s. Since this pattern was inhibited 
from the memory, any match signal generated in this cycle is inter- 
preted as an error condition. 

To make the system tolerant to transient errors, a threshold of two 
or more false preambles during a 60-second interval was established. 
If two or more false preamble errors are detected, the internal digital 
voice alarm (DVA) is set. To cover the possibility that the control 
counter has slipped out of sync with the stored messages, a sync retry 
cycle is initiated at this point. It starts by permitting the next preamble 
match signal to resync the chip timing counter. If another two or more 
false preambles occur, the external voice alarm test (VAT) signal is set. 
This same alarm is sei for any case of a faise preambie match signal 
during the data complement cycle. When an alarm is set during the 
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noncomplement data cycle, repeated attempts to resync the control 
counter to the preamble match signals are made. If a resync condition 
is achieved the alarm is reset and the system is returned to a normal 
state. 


IX. PHYSICAL DESIGN AND CIRCUIT PACK DESCRIPTION 


The five types of circuit packs used in the 13A are 8 by 11 inches. 
Except for circuit packs UD1 and UD2, all have three 963B-20 con- 
nectors for a total of 60 1/0 pins per pack. UD1 and UD2 need only 
two connectors each. All packs are class 1 (25-mil paths and spaces) 
double-sided printed-circuit boards. All controls and indicators for 
operation of the 13A are mounted on the front edge of the circuit 
packs. 

A major objective in the physical design of the 13A system was to 
eliminate hand wiring. This is accomplished by the backplane, shown 
in Fig. 13. This double-sided printed-wiring board interconnects all the 
circuit packs and the packs to the 123 1/o pins. Screw terminals are 
provided for -48V power and ground return. All other 1/o terminals 
are 0.045-inch-square wire-wrap pins. Only four wires are needed to 
connect the handset and tape jacks located on the mounting strip at 
the front of the system. 

Overall dimensions of the 13A system are 9.88 inches high by 12 
inches deep by 23 inches wide. Compatibility with all the types of 
offices and frames in which the 138A is used is achieved by ordering 
different brackets to mount the system in the various frames. On the 
front of the unit there is a smoked plastic door, hinged at the bottom. 
All system indicators can be seen through the closed door at either the 
top or bottom. The strip across the center of the door is opaque. When 








Fig. 13—The 13A Announcement System backplane. 
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the door is opened, the instructions on the inside of the door are 
visible. These instructions, shown in Fig. 2, contain a summary of the 
procedures for monitoring an existing message, recording a new mes- 
sage, and shutdown of power. All system indicators (LEDs) and control 
switches are also identified here. 

The first circuit pack in the 13A is the power supply, which is coded 
UDL. It is a pulse-width-modulated dc-to-dc converter. It converts the 
-48 volt supply to +15.75V at 1.6A, -15.75V at 0.5A, and +5.2V at 3.2A. 
Also, two low-current supplies, -4V at 100 mA and -8V at 100 mA, are 
derived from the -15.75V. When supplying these currents, which are a 
maximum for a fully equipped 13A, UD1 draws 1.6A from the -48V 
supply. The voltage outputs have a tolerance of +5 percent, except for 
the +5V supply. It has a tolerance of +3.5 percent. Circuitry is also 
provided for electronic shutdown if the voltages exceed their tolerance 
or if currents exceed specified values. Each message board adds a 
resistor in parallel with the current-sense resistor on UD1, changing 
the current-sense limit in proportion to the number of message boards 
in the system. Also, the +5.2V and +15.75V outputs are monitored, 
and both must be higher than the minimum values to result in a high 
level on the PWR SENSE output. 

The second common circuit pack is UD2, the encoder driver. It 
provides the circuitry for the input speech processing and encoding. 
This circuit pack also generates, amplifies, and provides analog switch- 
ing of the field drive signals. 

The other common circuit pack is UD3. This circuit pack contains 
the basic digital control and timing for the 13A. It controls the rotating 
field generation, provides bubble memory timing signals and chip 
sequencing, and placement of the recording of new messages. UD3 also 
provides the logic for message synchronization and error detection. 

UD4 is a variable-length message module that stores and plays back 
up to 24 seconds of speech. It has a variable message length (VML) that 
can be adjusted from 3 to 24 seconds in 3-second increments. UD4 
stores the digitally encoded speech in two 29A bubble memories. It 
converts the stored information back into analog speech, and it ampli- 
fies and buffers the output to drive up to 500 trunk circuits. Signaling 
closures are also provided by this circuit pack. 

UD6 is a variable-length message module that stores and plays back 
up to 12 seconds of speech. It is similar to UD4. In fact, the two codes 
use the same printed-circuit board. A UD4 has the second 29A and 
associated components, a UD6 does not. 


X. MANUFACTURING AND APPLICATION STATUS 
The 183A is manufactured at the Wesiern Hiectric Columbus Works. 
Manufacturing information was released on October 1, 1977. The first 
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13A was shipped to New York Telephone Company in February 1978. 
By the end of 1981, approximately 4188 13As had been shipped, and 
production should continue at the rate of about 110 systems per month. 
This requires 1670 29A memories per month. Each circuit pack must 
pass a circuit-pack test. Testing of UD38, UD4, and UD6 packs is 
automated, using computer-controlled test sets. The only adjustments 
during circuit-pack test are made on the message-module packs to 
match the detection threshold with the particular 29A device. In 
addition to circuit-pack tests, each 13A is given a system test before 
shipment. 

The first 60 systems manufactured went to a special application in 
New York City. This is the Automatic Dial Coin Zone (apcz) System, 
which uses announcements to automate handling of two-message-unit 
coin phone calls in the New York City area. Each 13A is equipped 
with six UD6 message-module packs, for a total of 360 29A’s in the 60 
systems. All systems were placed in service on July 1, 1978. 

At present 13A systems are also in service in Step-by-Step, No. 1 
and No. 5 Crossbar, No. 1/1A ESS, No. 2/2B ESS, and No. 3 ESS 
offices. In No. 3 ESS applications, the 13A is installed and shipped 
with the office from the manufacturing facility. For No. 1/1A and No. 
2/2B ESS the 13A’s are installed and shipped as part of an announce- 
ment frame. In all other applications the 13A’s are shipped directly to 
central offices for installation. 


Xl. CONCLUSION 


The 13A is the first application of magnetic bubble memories in the 
Bell System. The 13A performance and modular design has permitted 
it to satisfy a very large percentage of the Bell System announcement 
market. It is being used in all types of central offices, and will be used 
in No. 5 ESS. 

The 13A compares very favorably to the widely used electrome- 
chanical 7A system. The 138A is lower in cost for systems employing 
three or more channels, requires 77 watts for an eight-channel system 
versus 30 watts for a single 7A, and eliminates the need for costly 
periodic maintenance. 

Magnetic bubble memory technology has proven to be an excellent 
match to the needs of the 13A Announcement System. By using 
magnetic bubble memories as the storage medium, the 13A offers 
nonvolatile storage of recorded announcements. The high storage 
density of magnetic bubble memories has enabled the 13A to occupy 
as little as one-eighth the space previously taken by eight 7As. The 
high density and low cost of magnetic bubble memories relative to 
other nonvolatile and writable storage devices enable the 13A to be 
very cost competitive with electromechanical systems. 
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In its fifth year of production and at its current manufacturing rate 
of 110 per month, the 13A has proven to be a successful product. This 
is true both in terms of the use of magnetic bubble memories and the 
13A modular design. 
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a Nonlinear Optimal Assignment Problem 
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This paper poses an assignment problem with a nonlinear objective 
function. It ts formulated as an integer programming problem and a 
graph model is used to determine its exact solution. Application of 
the problem in long-range homing in telephone networks is also 
discussed. 


l. INTRODUCTION 


Consider a process in which at each stage an originating center 
must be connected to exactly one of a set of terminating centers. The 
originating center has a load and each terminating center has a 
capacity. The originating center can be connected to a terminating 
center only if the terminating center has enough capacity for its load. 
The originating center’s load and the terminating centers’ capacities 
vary with time (i.e., from one stage to another), but they are assumed 
to be known at all times. The problem is to determine the optimal 
connection configuration at each stage of time. The costs involved are 
the transmission cost and the rearrangement cost. Both costs are 
nonlinear functions of the originating center’s load. The transmission 
cost is the cost of connection of the originating center to a terminating 
center. The rearrangement cost is incurred if, in transition from one 
stage to the next, the connection of the originating center to a termi- 
nating center has to be changed because of insufficient capacity. 

Such an assignment problem may be encountered in many applica- 
tions. One important application arises in the design of hierarchical 
telephone networks.’® The process of connecting a switching center to 
a center in the next level of hierarchy in the backbone route of such 
networks is called homing. The problem of determining the optimal 
homing configuration of a switching center over several stages during 
a study period can be formulated as the above assignment problem.* 
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In such a case the originating center could be an end office and the 
terminating centers could be toll centers (i.e., switching centers in the 
next level of hierarchy). The load of the originating center would then 
correspond to the traffic volume of the end office and the capacities of 
terminating centers would correspond to the switching capacities of 
the toll centers. The transmission cost would be the cost of trunks 
homing the end office on a toll center and the rearrangement cost 
would correspond to the cost of changing the homing configuration. 

In this paper a graph model will be developed for the above nonlinear 
assignment problem. The solution to the problem will be converted to 
determining a shortest path on this graph. The algorithm developed 
for the solution is easily programmable on the digital computer. 


ll. FORMULATION OF THE PROBLEM 


Consider N stages of time indicated by ¢ = 1, 2, --- N. Let TC(t) 
represent the set of available terminating centers at stage ¢. The 
originating center OC must be connected to a terminating center TC; 
in this set at stage ¢t. Let 


five 1 if OC homes on TC; € TC(t) 
*A') =) 0 otherwise. 


(1) 


Indicate the load of the originating center at stage t by S(t) and the 
capacity of terminating center TC; at stage ¢t by C,(t). The costs and 
the constraints of the problem can then be formulated as follows. 


2.1 Costs 


The transmission cost per unit distance is assumed to be a nonlinear 
function f of the originating center’s load, as shown in Fig. 1. It includes 
a fixed cost that is independent of the originating center’s load. (In the 
case of the homing problem in telephone networks, this fixed cost 
would represent the cost of preparing for establishing a transmission 
facility, e.g., laying a cable.) The fixed cost will be incurred only for a 
new connection. For increasing the capacity of an existing connection, 
only the incremental cost will be incurred. 

Thus, the total transmission and rearrangement costs can be for- 
mulated as 

N 


X~  {(fES@)] -— FES — 1) ]xn(t — 1) a(t) de (2) 
t=1 TC,ETC(t) 
where S(0) = 0 and d; is the distance between OC and TC;. Note that 
S(t) is known a priori for ¢ = 1, 2, --- , N. 
Equation (2) includes the rearrangement cost, i.e., the cost incurred 
if in transition from one stage to the next the terminating center 
assignment changes. However, if such a change occurs, the old con- 
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f[S (t)] 


20 24 32 38 
S (t) 


Fig. 1—Transmission cost per unit distance as a function of the originating center’s 
load. 


nection will represent a saving in the cost that must be considered. 
This saving is, in general, a nonlinear function of the rearranged load. 
We call this function g[S(t)], which is obtained empirically for each 
application. Thus, eq. (2) must be modified as follows to represent the 
total transmission and rearrangement costs 

N 


Y= CFES(@)] — FLS(¢ — 1) ]xa(t — 1)} xa(t) de 


t=1 TC,ETC(t) 
N-1 1 


a) ae > 


t=1 TC,ETC(t) 2 


E[S(t)]|xe(é + 1) — xa(t)|. (3) 


The reason for including the coefficient % in the second part of eq. (3) 
is as follows. If in transition from one stage to the next the terminating 
center assignment changes from TC; to TC;:, the absolute value term 
will contribute 2 instead of 1 (1 for & and 1 for k’). 


2.2 Constraints 


Since the originating center must be connected to exactly one 
terminating center at each stage, we must have 


x(t) =1 fort=1,2,-.--.,N. (4) 
TC,ETC(t) 


Also, the load of the originating center cannot exceed the capacity of 
the terminating center to which it may be connected; thus 
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S(t)x.(t) = Cr(t) for all TC, € TC(t),t=1,2,---,N. (5) 


The problem can then be formulated as follows. Minimize the 
objective function (3) subject to constraints (4), (5) and 


x(t) =O0Oor1 for all TC, € TC(t),t =1,2,---,N. (6) 


ll. A GRAPH MODEL FOR THE PROBLEM 


The above problem is a nonlinear integer programming problem.” 
For most practical applications this problem will have a considerable 
number of variables and constraints. For example, for 10 stages and 8 
terminating centers there will be 8 X 10 = 80 variables and 10 + 8 x 
10 = 90 constraints in the problem. The nonlinearity of the cost 
function and the large number of variables and constraints render the 
available standard integer programming techniques (such as the 
branch and bound method’) impractical for the solution of this prob- 
lem. In this section a graph model will be developed for the problem, 
which aids in obtaining an exact solution for it. 

Define a matrix A whose rows and columns correspond to stages 
and terminating centers, respectively. The (¢, k) element of matrix A 
is defined as 


He By = i if OC can be connected to TC; at stage t 


0 otherwise. 


(7) 


Element a(t, k) of matrix A can easily be obtained by determining 
whether TC; € TC(t) has enough capacity for the originating center at 
stage t. Hence, constraint (5) will be used in the construction of matrix 
A. To incorporate constraint (4) and to perform the required minimi- 
zation, define a directed graph® G whose nodes are elements 1 of matrix 
A. The arcs of G connect node pairs in consecutive rows of A in the 
direction that ¢ increases. Then each path in G that connects a node 
in the first row of matrix A to a node in its last row forms a feasible 
solution to the problem where each node (é, k) of graph G included in 
such a path corresponds to x;(t) = 1. Note that since exactly one node 
in each row of matrix A is included in such a path, constraint (4) will 
be satisfied. 

Costs will now be assigned to the nodes and the arcs of graph G in 
such a way that the solution of the problem converts to solving a 
shortest-path problem on G. The cost associated with a vertical arc in 
graph G (i.e., an arc connecting a node pair in the same column of 
matrix A) is zero. A cross arc of graph G (1.e., an arc connecting a node 
pair in different columns of matrix A) from row ¢ to row ¢ + 1 has cost 
—%g[S(t)]. Each node of graph G corresponding to terminating center 
TC; and stage ¢ is assigned the cost { fj S(¢)j — f{/S(é— 1) jxz(¢ — D}dze. 
If the arc connecting the predecessor node of (¢, &) to it is a cross arc, 
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then the cost of node (¢, k) is f[S(t)]d,. If the arc connecting the 
predecessor node of (¢, &) to it is a vertical arc, then the cost of node 
(t, k) is {f[S(é)] — fES(t — 1)]}d,. With these assignments, the costs 
of the nodes and the arcs in graph G correspond, respectively, to the 
first and the second parts in expression (3). Hence, to solve the 
problem, a minimum cost path in graph G from nodes in the first row 
to nodes in the last row of matrix A must be found. 


IV. AN ALGORITHM FOR THE SOLUTION 


Since graph G includes no cycles, a labeling method similar to that 
used in the “shortest-path algorithm”® may be employed to determine 
paths of minimum cost. Label each node in the first row of A by its 
corresponding cost. The labels of nodes in the other rows of matrix A 
will be determined according to the following rule: 


Node label of its 


Label of Min predecessor node 


node (t, 2) all predecessor nodes + cost of the arc (8) 
connecting them 


+ proper node cost 


where 
f{S(t)]d: if the arc 
arriving at node (t¢, k) is 
proper node cost _ | across arc, (9) 
of node (¢, 2) {f[S(é)] — f[S(¢ — 1)]}d: if 


the arc arriving at node 
(t, k) is a vertical arc. 


With the above labeling method, the label of each node will be the 
minimum cost of path(s) in graph G connecting nodes in the first row 
of matrix A to it. An optimal solution to the problem is obtained by 
finding the node(s) with minimum label in the last row of matrix A 
and determining its predecessor(s) from which the label was produced 
and continuing to the first row of matrix A. Each node thus obtained 
will then identify the proper terminating center assignment at the 
corresponding stage. 

The above method is essentially a forward dynamic programming 
technique.’ It can also handle the case where a terminating center has 
been initially assigned to the originating center. In such a case, simply 
adjoin a row on top of the first row of matrix A such that all its 
elements are zero except the one corresponding to the initial termi- 
nating center assigned to the originating center, which is 1. In that 
case all paths in graph G will originate from this new node (and its 
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STAGE t=0 STAGE t= 1 


Tc TC Tc 
Fae 2130 [*¢2],5 4J0 





(eo) 


(a) (b) 


STAGE t=2 STAGE t=3 


: fea 
[7e4]6 2130 3} 36 4140 TO N65 45 TC 3| 4} 40 


(c) (d) 


Fig. 2—Load, capacities, and distances in the example for stages ¢t = 0 through ¢ = 3. 


label can be assumed to be zero). Note that for cost calculation in such 
a case, S(0) will have its actual value, not zero. 


V. EXAMPLE 


We will illustrate the above method by applying it to a problem 
involving three stages, ¢t = 1, 2, 3, and four terminating centers TC, 
k = 1, 2, 3, 4. Assume that initially (i.e., at t = 0) terminating center 
TC, has been assigned to the originating center. This is illustrated in 
Fig. 2a where the load of the originating center and the capacities of 
the terminating centers (in proper units) are indicated adjacent to 
them. Also, the distance between the originating center and each 
terminating center is indicated in Fig. 2a (i.e., d; = 6, dz = 3, d3 = 4, 
d, = 5 in proper units). Figures 2b, 2c and 2d show the originating 
center’s load and the terminating centers’ capacities in stages 1, 2 and 
3, respectively. Note that 


S(0) = 20, S(1) = 24, S(2) = 32, S(3) = 38. 


This configuration results in the matrix A shown in Fig. 3a. The 
corresponding graph G in which nodes are distinguished by circles and 


arcs are shown by directed links between the nodes is shown in Fig. 


9WL Aaay £. +hat th + vh 
ob. Assume, for convenience, that the cost characteristic shown in Fig. 


1 does not change with time and that 
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TC, TC, TC 3 TC, 





Fig. 3—Construction of the optimal solution in the example. (a) The matrix. (b) The 
graph with costs of arcs and nodes. (c) Node labels and the optimal path (in heavy 
lines). . 


f(20) = 4, £(24) = 7, f(82) = 11, £(38) = 18, 
as indicated in Fig. 1. Also assume that 
&(20) = 6, g(24) = 8, g(32) = 10. 


From the above information the proper node costs can be calculated. 
For example, for node (1,1) (corresponding to stage ¢ = 1 and termi- 
nating center TC) the proper node cost is 


f[S(1)]di = (7)(6) = 42, 
and for node (1,2) (corresponding to stage ¢ = 1 and terminating center 


TC) it is 
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{f[S(1)] — FLS(0)]}d2 = (7 — 4)(3) = 9. 


The costs of arcs and the proper node costs in graph G are written 
adjacent to them in Fig. 3b. Note that all the vertical arcs have zero 
cost and all the cross arcs from any stage to the next have the same 
(negative) cost. The node labels obtained by the above procedure are 
shown in Fig. 3c. The path distinguished by heavy lines in Fig. 3c is 
the minimum-cost path corresponding to the optimal solution. Thus, 
the optimal assignments are as follows: 






terminating 
center 
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In this paper we consider a physical model in which a buffer 
receives messages from a finite number of statistically independent 
and identical information sources that asynchronously alternate 
between exponentially distributed periods in the ‘on’ and ‘off’ states. 
While on, a source transmits at a uniform rate. The buffer depletes 
through an output channel with a given maximum rate of transmis- 
sion. This model is useful for a data-handling switch in a computer 
network. The equilibrium buffer distribution is described by a set of 
differential equations, which are analyzed herein. The mathematical 
results render trivial the computation of the distribution and its 
moments and thus also the waiting time moments. The main result 
explicitly gives all the system’s eigenvalues. While the insertion of 
boundary conditions requires the solution of a matrix equation, even 
this step is eliminated since the matrix inverse is given in closed 
form. Finally, the simple expression given here for the asymptotic 
behavior of buffer content is insightful, for purposes of design, and 
numerically useful. Numerical results for a broad range of system 
parameters are presented graphically. 


1. INTRODUCTION 
1.1 Physical model 

A data-handling switch receives messages from many, say N, infor- 
mation sources, which independently and asynchronously alternate 
between the ‘on’ and ‘off’ state. The on periods as well as the off 
periods are exponentially distributed for each source. These two dis- 
tributions, while not necessarily identical, are common to all sources; 


* Presently at the Department of Mathematics, University of California, Berkeley. 
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also, the sources are mutually independent. Without loss of generality, 
the unit of time is selected to be the average on period; with this unit 
of time, the average off period is denoted by 1/A. Again, without loss 
of generality, the unit of information is chosen to be the amount 
generated by a source in an average on period. In these units an on 
source transmits at the uniform rate of 1 unit of information per unit 
of time. Thus, when r sources are on simultaneously, the instantaneous 
receiving rate at the switch is r. The switch stores or buffers the 
incoming information that is in excess of the maximum transmission 
rate, c, of an output channel. (Thus, c is also the ratio of the output 
channel capacity to an on source’s transmission rate.) 

As long as the buffer is not empty, the instantaneous rate of change 
of the buffer content is r — c. Once the buffer is empty, it remains so 
as long as r = c. We assume that the buffer is infinite and that the 
following stability condition is satisfied: 


ls < 
c(1 + A) 


The left-hand side is the traffic intensity, p. 

Discussions with A. G. Fraser’ suggest that the above is a useful 
model for a switch in a computer network. In such an application, the 
output channel rate may be in the range from 5 kb/s to 56 kb/s. For 
one specific source type, the slow terminals, the message rate may be 
taken to be 300 b/s, which gives 16 % and 186 % as the extreme values 
of c. A representative value of \ for this source type is 0.4, as computed 
from the fact that A/(1 + A) is the long term on time fraction. The 
stability condition is satisfied with N as large as 3.5c. Other sources, 
such as screen terminals and computers, will have quite different 
statistics. In the interests of generality, we have not placed any further 
restrictions on the system parameters. | 

We first derive in a straightforward manner the set of differential 
equations that governs the equilibrium buffer distribution. We obtain 
a set of mathematical results that renders trivial the computation of 
the distribution and its moments and thus also the waiting time 
moments. The main result explicitly gives all the system’s eigenvalues. 
This is achieved by using the generating function method. Insertion of 
the boundary conditions in the differential equations requires the 
solution of a matrix equation, which in many cases of practical interest 
is dimensionally quite large. However, even this step is eliminated 
since the matrix inverse is given in closed form. Finally, simple expres- 
sions for the moments of the distribution and the asymptotic behavior 
of buffer content are obtained. 

The physical model described above is related to the model in our 
primary reference, a powerful paper by L. Kosten.” Kosten’s model is 


1. (1) 
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a limiting case of our model with N — «, X — 0 in a manner so as to 
give a finite traffic intensity, p. Also, pooling the instants of commence- 
ment of the on periods of all the sources yields, by assumption, a 
Poisson process. The above physical model and its variants have also 
been proposed in other papers.*® 


1.2 Motivations and discussion of results 


Two broad questions supply most of our motivation. In this connec- 
tion we acknowledge the benefit of several discussions with our col- 
league A. G. Fraser. The first question concerns the right buffer size to 
use for a predetermined number of sources and grade of service. The 
other question, which is of operational significance, concerns the 
selection of the maximum number of sources to be allowed in the 
system, the reasoning being that the incremental source disproportion- 
ately affects the grade of service for all the sources. 

The study of these questions requires that the number of sources in 
the system, N, be finite. We also examined the conventional belief that 
the traffic intensity is a reliable indicator of overflow probabilities. 

We would like to draw the reader’s attention to an important aspect 
of our problem, namely, numerical stability. Underlying this problem 
is the fact proven below that the set of linear differential equations 
governing the behavior of the equilibrium probabilities [see eq. (8)] 
has ‘unstable’ eigenvalues. (This is cause for calling the system of 
equations ‘inherently unstable.’) Thus, if the boundary conditions are 
such that any of these modes are excited, then the solution grows at 
an exponential rate. In the mathematical model this does not happen. 
However, the situation during computation is quite different. The 
inevitable errors, no matter how small, incurred during numerical 
integration are liable to excite the unstable modes and lead to solutions 
that blow up. 

The above observations apply as well when the Laplace transforms 
of the equilibrium probabilities are available and are to be numerically 
inverted. The boundedness of solutions is, in principle, obtained by 
the exact cancellation of unstable factors in the numerator and denom- 
inator of the transform. Of course, a straightforward numerical inver- 
sion cannot be expected to preserve this feature. 

Therefore, it appears inevitable that any method that counters the 
inherent instability must depend on the a priori segregation of the 
stable modes from the unstable modes. This in turn depends on the 
availability of complete information on the eigenvalues and eigenvec- 
tors—information that is generally costly to obtain. We compose the 
solution to eq. (8) in the form 

F(x) = YY Ave”, x=0, (2) 


t:Rez,;<0 
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where the z,’s appearing above are a subset of the eigenvalues, and the 
coefficient vectors {A;} depend on the boundary conditions and eigen- 
vectors. While the problem of numerical stability does not arise when 
the solution is computed in the above form, its effectiveness depends 
on the efficiency of the computation of the eigenvalues and coefficient 
vectors. An example of the efficiency achieved is that we are able to 
obtain all the eigenvalues by solving only a set of quadratic equations. 

We should mention that Kosten’ is fully cognizant of the problem of 
numerical stability. Kosten’s solution method consists of obtaining the 
initial conditions and then numerically integrating the differential 
equation while continually filtering away the component of the nu- 
merical solution that exists in the span of the eigenvectors associated 
with the unstable eigenvalues. 

A noteworthy feature of our primary solution method is that it 
manages to avoid requiring the numerical solution of matrix equations. 
This is achieved by avoiding the direct procedure (see Section III) 
which requires the solution of a dense set of [c] + 1 linear algebraic 
equations.”* Instead, we require the solution of a typically much larger - 
set of N — [c] linear algebraic equations which, however, we obtain in 
closed form. 

Numerical results are discussed in Section VI. We observe substan- 
tial departures from Kosten’s results in cases where N is small. More 
generally, we observe for identical traffic intensities, rather different 
probabilities of overflow for different values of N. We graphically 
demonstrate the quite acceptable quality of an approximation to the 
overflow probabilities provided by a relatively simple asymptotic for- 
mula. The formula states that, for x large, the probability of buffer 
content exceeding x behaves as Ae”™, A some constant and 

a (3) 
1—c/N 
The positive parameter r is thus, like traffic intensity, a predictor of 
overflow probabilities. Small values of r may be associated with high 
probabilities of overflow and low grades of service. 


1.3 Mathematical model 


If at time ¢ the number of on sources equals 1, two elementary events 
can take place during the next interval At, i.e., a new source can start 
or a source can turn off. Since the on and off periods are exponentially 
distributed, the respective probabilities are (N — 1)AAt and 1At. Com- 
pound events have probabilities 0(At”). The probability of no change 
is 1 — {((N—i)A + i} At + O(At’). 


* We let |[c| denote the integer part of c. A tacit assumption is that c < N, since 
otherwise the buffer is always empty. 
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Let Pi(t, x), 0 =is N, t= 0, x = 0, be the probability that at time 
t, i sources are on and the buffer content does not exceed x. Now, 
Pi(t + At, x) = {N — (i — 1) }AAEPi-1 (8, x) + (0 + LI) AtPi41(, x) 
+ [1 — {(N — i)A + i}AC]Pi{t, x — (i — c)At} + O(At?). (4) 
Passing to the limit At > 0: 
ote (0) (SEA aNP es 
—{(N-—iA+}Pi+ © +1)Piti. (5) 
We are interested only in time-independent, equilibrium probabilities, 


(6) 


equilibrium probability that z sources are on 
and buffer content does not exceed x. 


Therefore, we set dP;/dt = 0 and obtain, for i € [0, N], 


F(x) A 


(i — c) =. (N-—it LAF — ((N-iA+ DR + G+ Fin, (7) 


where the understanding is that F; = 0 if i is not in the stated interval. 
In matrix notation, 


d 
D — F(x) = MF(x), x= 0, (8) 
dx 
where D = diag {—c, 1 —c,2—c¢, ---,N-—c} and 
—NA 1 
NA -{(N-DA+F+) 2 


(N — 1)A —{(N — 2) + 2} 3 


2. — {A+ (N- 1)} N 
r 


Note that the initial conditions to the differential equations are as yet 
unspecified. Considerations relating to their determination may be 
found in Section III. 
Let* 
G(x) 4 Pr(buffer content > x) = 1 — 1’F(x), x= 0. (9) 


We refer to G(x) as the ‘probability of overflow beyond x’, or, loosely, 
as just the ‘probability of overflow.’ Also observe that 


* We let 1 denote the vector with unity for all its components and prime denote 
transposition. 
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N i <j< 
Fees) = as (7) 0<i<N, (10) 


since F;(co) is the probability that 2 out of N sources are on simulta- 
neously. Obviously, 


N 
¥ Fi(w) = 1. 
i=0 


In the analysis to follow, we assume that c is not an integer. (When 
c is an integer, one of the differential equations in (7) degenerates to 
an algebraic equation that may be used to eliminate one of the 
unknown components of F.) 

In Ref. 2 the elements of the matrix M below the diagonal are 
identical, i.e., independent of row number, and the diagonal element is 
accordingly adjusted to give column sum 0, as in (8). 

The work of Arthurs and Shepp considers various models related to 
the one considered here; the emphasis of their analysis is on obtaining 
Laplace transforms of the probabilities.® Cohen’ obtains a broad range 
of results for the case c = 1. 


ll. EIGENVALUES AND EIGENVECTORS 
2.1 Computing the eigenvalues 


Let z be some eigenvalue of D~'M and let ¢ be the associated right 
eigenvector. That is, 


zD¢ = Me. (11) 
Equation (11) is also 
2(i— c)oi = ANN + 1 — LU) Gi-1 — ((N — 1)A + Ot} di + (0 + 1 i41, 
O<i=<=N. (12) 
Let ®(x) denote the generating function of 4, i.e., 
N 
D(x) AY pix’. (13) 
i=0 
By multiplying (12) by x’ and summing over t we expect to obtain an 
equation in ®(x) and ®’(x) [for example, >) ix’¢; = x®’(x)]. In fact, 
O'(x) ze — NA + NAx (14) 
P(x) Ax? + (z+1—A)x—-1 


In preparing to solve the differential equation, we define 7; and rz to 
be the distinct real roots, 71 > 0 > re, of the quadratic in the denomi- 
nator of the right-hand side, i.e., 


ails [en ae 2a RRO eae, a sam 4 
r= {-(z+1—A) + v(z2+1—A)* + 4A}/2A (ida) 


ry = {-(z2 #1 —d) — Viz +1 —A)* + AA} /2A. (15b) 
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Equation (14) may now be written as 


0’ (x) C1 C2 











(x) Tiga Ga 2 
where the residues are computed to be 
ce=N-c (17a) 
oe ze ——e (17b) 
The solution to (16) is 
B(x) = (x — r1)(x — 72)", (18) 


where, as in the rest of the paper, we have assumed on = 1. 

There is an observation on (18) to be made that is central to the 
present derivation. Observe that by its definition in (13), B(x) is a 
polynomial in x of degree N. Since 7; and rp are distinct, this is possible 
if and only if c,, defined in (17b), is an integer in [0, NV]. Denoting this 
integer by k we get 


O(x) = (x —n)*(x —172)**, K=0,1,---,N. (19) 


If in (17b) we write k for ci, use (15) to substitute expressions for 7 
and 7 — re, rearrange, and square, then we obtain the following family 
of quadratics in the unknown eigenvalue z, 


A(k)z? + B(k)z + C(k) =0, R=0,1,---,N 


where, 


A(k) & (N/2 — k)” — (N/2 - ¢)? 


Bik) 4 2(1 — A)(N/2 — k)? — N(1 + A)(N/2 — ©) (20) 
C(k) & —(1 + A)*{(N/2)? — (N/2 — k)”}. 





We denote by z{"? and z¥) the two roots associated with the kth 
quadratic. 

To recapitulate, we have shown that all the roots of the above family 
of N + 1 quadratics are eigenvalues, as defined in (11). The reader will 
find in the following section an enumeration of the properties of the 
roots and eigenvalues. 

We observe that the above argument takes the place of the argument 
“®(x) should be an entire function of x” employed by Kosten.” 


2.2 Properties of the roots of the quadratics 


The theorem below, which is presented in conjunction with Fig. 1, 
is a collection of various properties of the roots. 
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A (k) 22 +B (k) Z+C (k) A (k) Z2 +B (k) Z+C (k) 





INCREASING INCREASING 
k (k $N/2) k=\cy+1 k (k $ N/2) 


(a) (b) 


Fig. 1—Sketches of graphs of quadratics, N odd. If N is even, the quadratic for k = 
N/2 has a repeated real root. (a) c< N/2. (b) N/2<c. 


Theorem 
(t) The quadratics for k and k’ are identical when N/2 — k = 

— N/2. 

(it) For each k < N/2 the corresponding quadratic has two real 
and simple roots. When N is even and k = N/2, the corresponding 
quadratic has a real repeated root. 

(iii) k'’ <kR=N/2—>A(k’)z? + Blk’)z + Clk’) > A(k)z? + Blk)z 
+ C(k), Vz. 

(iv) The roots of the quadratic corresponding to any k are distinct 
from those of the quadratic corresponding to any k’, provided k’ <k 
< N/2. 

(v) Ignoring the multiplicities, there are N — |c] negative roots, 
1 root at 0 and |c| positive roots. (If the inequality in the stability 
condition is reversed, then there is 1 less negative root and 1 more 
positive root.) 

(vi) The set of eigenvalues coincide exactly with the set of roots of 
the quadratics. 

(vit) The largest negative eigenvalue is —(1 + AX — NA/c)/ 
(1—c/N). LJ 
The proof of the theorem is given in the appendix. 

We will employ the following convention for the eigenvalues: 


ZN-[cJ-1S °° SS Z1<2< 2nN= O<zn-1< +00 < ZN-|c]- (21) 


With this convention, z, and zy-, are roots of the kth quadratic, i.e., 
(2h, Zn-r} = {Z1 92 qs 
Since we shall later require the stable or negative eigenva 


+ 
be explicit about their computation. Let N be first odd and employ the 
notation z{*’ < z$"). The stable subset is given in braces. 


1, 
i 


wAA 
UCD; 
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If c<WN/2: 
z{) = [-B(k) — VB*(k) — 4A(k)C(k)]/2A(k), 


O0<k<lc].| (22a) 
2{3 = [-B(k) = VB%(k) — 4A(R)C(k)]/2A(R), 
lcJ+1Sk<N/2 
and if N/2 <c: 
{2 = [-B(k) — VB*(k) — 4A(k)C(R)]/2A(R), 
0<k=N-[c]—1.}. (22b) 


When N is even, the only change is that the set in (22a) is augmented 
by —N(1 + A)/(N — 2c), which is one of the repeated roots of the 
quadratic associated with k = N/2. 


2.3 Eigenvectors 


We mention the procedure for obtaining the eigenvectors from the 
eigenvalues. Given an eigenvalue z, we compute in order, using (15) 
and (17), the quantities r;, r2, and k. These are used in (19) to yield 
the coefficients of the given polynomial and thus the eigenvector 
coefficients. Therefore, for the ith component of the eigenvector* 


k 
gi = (IY (*) @ 7 ; rip) Q<i<N. (23) 
j=0 \J L—J 

Of particular subsequent interest are the eigenvectors oy and ¢o, 
corresponding respectively to the eigenvalues zy = 0 and 2 = (1 + 
A — NA/c)/(1 — c/N). The vector @n may either be obtained directly 
or by noting that it is the appropriately normalized vector of equilib- 
rium probabilities F() given in (10). Therefore, 


, 


_i N t N 
dag] BAR (T)M ood 


N 
and l’@yv = (+) . (24) 


The vector @o is obtained by following the procedure mentioned in the 
previous paragraph. We find that 

aot 1 
~X N/e-V 


N 
®(x) = {3 + (> = 1) , (25) 


* We let @; and ¢i, respectively, denote the ith eigenvector and the ith component 
of the generic eigenvector. 


k=N, 


n=1-—-, re 
c 
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Hence, 


N-~i 
(0)i = (7) (3 — 1) ; O<i=N (26) 


U 
oe NN 
16 = (2) : 


This example serves to illustrate a noteworthy point: Recall that for a 
specific k we may obtain an eigenvalue z using (20); for the same z we 
may obtain k’ from (15) and (17), as outlined in the first paragraph of 
this section. It may be that k ¥ k’; however, it is always true that | N/ 
2—k|=|N/2—k’|. This should not be surprising in view of statement 
(t) of the theorem, and it is due to the operations, including squaring, 
that allow us to go from (15) and (17) to (20). 


and 


2.4 Left eigenvectors 


Sometimes, as in Section 3.3 below, not only the right, but also the 
left, eigenvectors 7, 


zV/D=wWM (27) 


are required to be known. It is reasonable to expect that the procedure 
in Section 2.1 can be repeated to obtain the generating function of the 
left eigenvectors, but we have not found this approach to be tractable. 
However, the procedure outlined below may be used. 

There is a diagonal matrix + that symmetrizes M, i.e., 


tT Mt = (7 'M7)’. (28) 


1/2 
na {n (¥)} , OSi=N, (29) 


where 7; is the ith diagonal element of 7. 

Define y and @ to be the left and right eigenvectors obtained when 
D is replaced by t ‘Dt and M by 7 ‘Mr. It is easy to see from the 
defining relations that 


@=T' and ~= ry. (30) 
Now notice the important fact that since 7 *Mr is symmetric, it has 
identical left and right eigenvectors. Hence, 


In fact, 


y= 6 
or, component-wise, 
l /N\ ail 
es poe Osi1=N. (31) 





Thus, the left eigenvector may be obtained from the right eigenvector. 
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il. THE SOLUTION 


The solution to the differential equations in (8) with F(0) = f, can 
be written as 
N wWiDf 
F(x) = 7i* ——__ qh, 32 
() Py . oiDy; ? (32) 


It is clear, however, that as only bounded solutions are allowed, 


N-(c]-1 Y/Df 

F =F + 7i* —___ &,, 33 
(x) = F(c) » oe Da, @ (33) 
where, according to our convention in (21), the z,’s appearing in the 
above expression are all negative. The term F() in (33) is identical to 
the i = N term in (32). Recall that F() is already known, as shown in 

eq. (10). With appropriate identification, (33) also may be written as 

N-[e]-1 


F(x) =F(~)+ YY e* agi. (34) 
i=0 


Our primary solution method developed below in Sections 3.1 and 
3.2 depends on the explicit solution of the coefficients a; in the form 
appearing in (34). In Section 3.3 we give a second, contrasting, method 
in which the initial condition vector f is numerically solved and 
substituted in (33). 


3.1 A key property of the solution at the boundary x = 0 


If the number of sources on at any time exceeds c, then the buffer 
content increases and the buffer cannot stay empty. It follows that 


F (0) = 0, leJ +1 SisNn. (35) 


By supplementing (35) with the tri-diagonal structure of the matrix 
D~'M we may make further deductions regarding the behavior of F(x) 
when x is small. Observe that an application of D’'M on F(0) will 
diminish by 1 the number of trailing elements that are zero, and that 
each additional application will have the same effect until 
(D"*M)**!"1#(0) has only its last component equal to zero and 
(D~'M)*'“F(0) has none. Thus, 


{(D7'M)/F(0)}; = 0, le} +1+ 7S. (36) 


Now recollect that on account of the governing differential equations 
for F(-) in (8), 


F‘(0) = (D™'M)’F(0). (37) 
Thus, from (36) we find that 
FYO)=0, Le] +1+s7si, (38) 


and, in particular, the following relation, which we shall find most 
useful: 
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FY(0)=0,  j=0,1,---,N—[e]—1. (39) 


Thus, not only is the event “all sources are on and buffer is empty” of 
probability zero, as already is known from (35), but the growth of the 
probability is also slow when the buffer content is small. 


3.2 Procedure for obtaining the solution 


We proceed to obtain the coefficients {a;} in the solution expression 
(34). Recall that by convention {@;}~ = 1, so that from (34) we find 
that 


r N N-|c]-1 
Fy(x) = (5) + x aie**, x=0. (40) 
The above, taken with (39), implies the following set of equations: 
N-[c]-1 x N 
2 (z;)‘a; = -(5) Soi, OsisN-l[c]—-1. (41) 


Equation (41) in matrix form is 


N 
Se fe 2 
Va (; 7 :) e, (42) 
where Vi; = (z;)’, a = (ao, Qi, +++, an-{cj-1)’ and e = (1, 0, --- , 0)’. 


The key observation is that V is a Vandermonde matrix. Well known 
results on such matrices allow us to solve (42) explicitly. Note that V 
is nonsingular because the eigenvalues {z;} are distinct,® as previously 
established in the theorem (see Section 2.2). Therefore, 

eee | (2i:zj). (43) 
O<i=j=N-[c]-1 
This formula, applied to the minors, which also are related to 


Vandermonde matrices, gives 
N N-|c]-1 2; 
; 0sj=N-l[c]—1.| (44) 





i-0 Zi 2 
ie 


To summarize, the above procedure for obtaining the equilibrium 
probabilities F(x) and the probability of overflow G(x) is based on 
using the expressions 

N-[cJ]-1 


F(x) =F(o)+ Y e**aidi (45) 
i=o 
and 
N-te]-1 
G(x) =—- x e**a;(1’@i). (46) 
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Only the stable eigenvalues appear in the above forms and they are 
explicitly given in (22); {;} is obtained from the generating function 
in (18) and the coefficients {a;} appear in (44). Note that 1’¢ = ®(1), 
so that {¢;} need not be computed explicitly to compute G(x). F(0) is 
given in (10). 


3.3 An alternative procedure for obtaining the solution 


Under certain conditions the following procedure may be considered 
a viable alternative to the one described above. Here we compute 
F(0) = f and use it in the solution series (33). 

Recall from (35) that only the leading [c] + 1 elements of f are 
nonzero and need to be computed. Precisely the same number of 
equations are forthcoming by requiring of the initial conditions that 
the Lc] unstable modes are not excited (compare with Kosten’s “illegal 
eigenvalues”) and that F() is normalized, i.e., 


WDf=0 i=N-1,N-2,---,N-—[e] 


Nr F 


In matrix form, 
f’ = {f1, 0’} (48) 
Nx 
Af; = {- 
and Af; ( e+ the , 





where the coefficients in (47) multiplying the unknown f have been 
arranged to form the matrix A in (48). We note parenthetically that A 
is not sparse. The ([c] + 1)-dimensional matrix equation in (48) has to 
be solved. 

Recall that our primary procedure in Section 3.2 and the one shown 
above require solutions of matrix equations. However, in the former 
case we were able to explicitly obtain the solution even though the 
dimension there, N — [| c], is typically much greater. In the absence of 
an explicit inverse for A, we expect the above procedure to be useful 
only for small c. 


iV. ASYMPTOTICS 
4.1 Probability of overflow 


Here we examine the behavior of G(x), the probability of overflow 
beyond x, for large values of x. The asymptotic formulas obtained are 
useful for the following reasons: As can be seen from the numerical 
results in Section VI, they often describe the system behavior rather 
well in all but the regions of lesser importance, where x is small; also, 
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the analytic formulas are simple, even though they contain the essen- 
tial information. 

Since the form of the solution in (34) is a sum of exponential terms, 
the departure of F(x) from F() will be dominated by the exponential 
with the largest exponent. Hence,* 


F(x) — F(%) ~ aogoe™ (49) 
and 
G(x) ~ —ao(l’oo)e™, (50) 


where —r(=2o) is the largest negative eigenvalue of D~'M. Using 
statement (vii) of the theorem, we find that 


(l1+A—NA/c) (1+A)(1- op) 
r= 5s (51) 
1—c/N 1—c/N 
where in the latter, more suggestive form, p is the traffic intensity. The 
coefficient a@) and the eigenvector @ are given in (45) and (26). 
Collecting terms, we find that 


(52) 





In the event that the alternative procedure given in Section 3.3 is 
used, then the following asymptotic formula is more relevant. 


G(x) ~ Ae 
where 
a N-ec - p 1 lel (ec — i) fi 
a fe 2c + os) E — p) “N= 4 yi A(N/e — 1)" (62) 


4.2 Limiting equations for an infinite number of sources 

Here we bridge some of the results presented in this paper and 
Kosten’s results. We will show that some of Kosten’s important expres- 
sions are obtained by passing to the appropriate limit, namely, 


N— , A—0 and NA= i. (54) 


(Our notation is close to Kosten’s with one notable exception: A is 
Kosten’s A.) The results obtained in the limit may be interpreted to be 
applicable when the number of sources grows large and the fraction of 
time that each source is on decreases in such a manner that the traffic 
intensity approaches the fixed constant A/c. 

In this section we follow Kosten and normalize the generating 


* By a(x) ~ b(x) we mean that a(x)/b(x) = 1 as x— ©. 
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function to yield ¢o = 1 (rather than, as in the rest of the paper, én = 


1), so that 
x k ie N-k 
O(x) = (1 _ =) (1 — =| ; (55) 
Tr) r2 


where r; and r2 are as in (15). It may be shown that, in the limit (54) 
r)— 1/(z + 1) (56) 
rg > —N(z + 1)/d; (57) 
and, furthermore, on using (55), 
B(x) > e?/D (1 — x(1 + z)}4 (58) 


which is Kosten’s expression for the generating function. 
To establish another correspondence, observe that on substituting 
(17b), the expression for 7; and re in (15), we obtain 


ze — NA+ NA[—(z +1—A) + vV(z+1-—A)?+ 4A]/2A 
Vie@+1—A)?+4a 


The limit of the right-hand side may be obtained and found to give 
Kosten’s key equation 


k= z(ez +e—A)/(z+ 1)”. (60) 


k= (59) 


The correspondence in the eigenvector components is particularly 
illuminating, showing that 


ere BACIN AR) po 552) yer, 
wor § ()(XB) carey 


and, from (23), that 


ant k ; l i-2j 
wey (No 0 (25) d (61) 


A final correspondence concerns the important rate parameter r 

appearing in the asymptotic formulas in Section 4.1. We find that 
(1 + A)(1 — p) “ 
r=—————_-> 1 --p, 62 
i—c/N p (62) 

where p is simply the limiting traffic intensity \/c. 

We should also mention that certain key results of this paper, such 
as those pertaining to our primary solution method in Sections 3.1 and 
3.2, have no parallel in Ref. 2. 





V. MOMENTS 


We give below expressions for the moments of the equilibrium buffer 
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content, which allows for their easy computation once the elements of 
the exponential series solution in (34) are available. 
We observe that the nth moment 


E(x”) -| x"d {1'F(x)} =n | x" 1G (x)dx. (63) 
0 0 


Since 
N-teJ—1 


G(x) =- > e*a;(1'di), 


i=0 
as we saw in eq. (46), we obtain 
nt “tel! a ,(1’i) 
(—1)"** i=0 ra 
If the alternative procedure given in Section 3.3 is used to obtain 
the solution, then (64) may again be used with the identification 
yiDf 


a; = Wide: (65) 


E(x") = (64) 


VI. NUMERICAL RESULTS 
In Figs. 2 through 5 we have held the source statistic A to a constant 


\ 
S\ ASYMPTOTIC 
FORMULA 


_ ASYMPTOTIC 
FORMULA 


LOG PROBABILITY OF OVERFLOW 





0 4 8 12 16 20 
BUFFER SIZE 


Fig. 2—Probability of overflow vs buffer size with A and c constant. For N = 25, 30, 
40, and 50, the traffic intensity p is 0.43, 0.51, 0.69, and 0.86, respectively. 
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Fig. 3—Probability of overflow vs buffer size with \ and c constant. For N = 65, 75, 
85, and 100, the traffic intensity p is 0.56, 0.65, 0.73, and 0.83, respectively. 
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Fig. 4—Probability of overflow vs buffer size with A and c constant. For N = 150, 175, 
and 200, the traffic intensity p is 0.64, 0.75, and 0.86, respectively. 
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\=04 
c = 133.333 
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Fig. 5—Probability of overflow vs buffer size with A and c constant. For N = 350, 375, 
and 400, the traffic intensity p = 0.75, 0.80, and 0.86, respectively. 
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Fig. 6—Probability of overflow vs buffer size wiih traffic intensity p and c constant. 
For N = 5, 10, 20, and 40, the parameter A is 0.18, 0.08, 0.04, and 0.02, respectively. The 


curve for N = & is from Ref. 2. 
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c= 16.666 
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Fig. 7—Probability of overflow vs buffer size with traffic intensity p and c constant. 
For A = 0.6, 0.4, 0.2, and 0.15. The number of sources N = 33, 44, 75, and 96, respectively. 
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Fig. 8—Probability of overflow vs buffer size with traffic intensity p and c constant. 
For A = 0.6, 0.4, 0.2, and 0.15, the number of sources N = 267, 350, 600, and 767, 
respectively. 


value, 0.4. Each figure has a distinctive value of c, the ratio of output 
to input transmission rates. The four values of c are chosen for the 
cases where an on source transmits at 300 b/s and the output channel 
rates are 5 kb/s, 10 kb/s, 20 kb/s, and 40 kb/s. These figures show 
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ASYMPTOTIC RATE r=0.5 
c= 16.666 
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Fig. 9—Probability of overflow vs buffer size for constant asymptotic rate r [see eq. 
(51)] and c. For p = 0.6, 0.65, 0.7, and 0.8, the number of sources N = 127, 85, 63, and 41, 
respectively. 
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Fig. 10—Probability of overflow vs buffer size for constant asymptotic rate r and ec. 
For p = 0.6, 0.65, 0.7, and 0.8, the number of sources N = 507, 338, 253, and 164, 
respectively. 
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rather clearly the effect of incremental sources. For example, we see 
from Fig. 2 that for a fixed grade of service as given by the probability 
of overflow = 10~°, a 33-percent increase in the number of sources from 
30 to 40 requires about a 300-percent increase in buffer size from 2 to 
8. Recall from the discussion in Section 1.1 that the unit of information, 
and thus of the buffer as well, is the amount generated by one source 
in the average on period. 

Observe in Figs. 2 through 5 the generally acceptable quality of the 
approximation to the probability of overflow provided by the asymp- 
totic formula in eq. (52). 

In each of Figs. 6 through 8 we have a constant traffic intensity, p, 
and a constant ratio of transmission rates, c. Note in particular that 
any two curves will have different source statistics A and different N. 
The figures in this series illustrate the difference between the model 
considered in this paper and Kosten’s limiting model. The figures also 
demonstrate rather emphatically the limitations of using only the 
traffic intensity as a predictor of overflow behavior. For example, in 
Fig. 7 we see that for constant traffic intensity and buffer size = 20, 
probability of overflow varies from about 10~* to about 107°, depending 
on A. 

In Figs. 9 and 10 we examine the proposition that the rate parameter 


50 


r=0.2 
PROBABILITY OF OVERFLOW <10-5 


40 


30 


20 
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c= 8.333 ~_ 
a 


c= 16.666 ~_ — — c= 33,333 
~ 
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Fig. 11—Minimum buffer size required to satisfy traffic intensity p with constrained 
probability of overflow. The constant is A. 
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Fig. 12—Minimum buffer size required to satisfy traffic intensity p with constrained 
probability of overflow. The constant is A. 


r, which gives the slopes of the curves obtained from the asymptotic 
formula, is a useful single index of overflow behavior. Equations (3) 
and (51) give r. The strength and limitations of the index are contrasted 
in Figs. 9 and 10. 

Figures 11 and 12 are motivated by the design problem in which it 
is required to estimate the buffer size needed to meet various traffic 
conditions with a specified grade of service and fixed source statistics. 
The slackening requirements in the buffer size with increasing c, which 
are observed in both figures, denote the economies of scale that stem 
from using higher capacity output channels. In comparing Figs. 11 and 
12 we observe that for the same traffic intensity, buffer requirements 
are less stringent when J is greater, i.e., when the source type is on for 
higher fractions of time. 
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APPENDIX 
Proof of Theorem 


(i) Observe that A(k), B(k), and C(R), as given in eq. (20), depend 
on k only through (N/2 — k)”. 

(ii) Roots are real if B?(k) — 4A(k)C(k) = 0, and distinct as well 
if the expression is positive. View the expression as a function of 
(N/2 — k)”. Observe that it is a quadratic in (N/2 — k)? and, further- 
more, that it is concave since the leading coefficient of (N/2 — k)* is 
A(1 — A)? — 4(1 + A)? < 0. Thus, the minimum of B?(k) — 4A(Rk)C(k) 
for (N/2 — k)? in [0, (N/2)?] is at one of the corner points where the 
respective values are 0 and positive. 

(tit) We claim that 


{A(k’) — A(k)}2*° + {B(k’) — Blk) }z 
+ {C(k’) — C(k)} > 0, Vz. (66) 
To prove eq. (66) we need to observe that 
A(k’) — A(k) > 0 (67) 
and 
{B(k’) — B(k)}? — 4{A(k’) — A(R)}{C(R’) — C(R)} <0. (68) 


(tv) It follows immediately from (iiz) that the graphs of the quad- 
ratics are nonintersecting. See Fig. 1. 
(v) We first consider c < N/2 and later its opposite. Consider in 
turnk=0,0<k<=c,andce<k=WN/2. 
When k = 0, it turns out that C(k) = 0, so that the quadratic reduces 
to 


A(0)z{z + B(0)/A(0)}. (69) 
Hence 
B(O) -(1+A—NdA/c) 
(0) _ _ a 
AO 1—c/N ae 
and 
zy =0. (71) 


DATA HANDLING 1893 


Observe that the stability condition implies that z{” < 0; reversal of 


the inequality in the stability condition gives z{° > 0. 

Now consider 0 < k < c. Observe that A(k) > 0 and C(k) < 0, so 
that z{) <0 < z#’. 

Finally, consider c < k < N/2. It is easy to see that A(k) < 0 and 
C(k) < 0. We show below that B(k) < 0, from which it will follow that 
zi <0 and z$ <0. If \ = 1 then the defining expression for B(k) in 
eq. (20) shows that B(k) < 0. If AX < 1 then the following equivalent 
expression shows that B(k) < 0: 


N 2 N 2 
B(k) = 2(1-A) | (F-*) = (Fe) | 
~2e(2~e)(1-a+ 4). (72) 
Z c 


The above completes the considerations related to c < N/2. 

For the opposite situation where N/2 < c, the case of k = 0 is 
unchanged. For 0 < k < N — ¢, it is easy to see that A(k) > 0 and 
C(k) <0, so that 2{” <0 < 2, 

Now consider N — c< k < N/2. We show below that B(k) > 0 (note 
the contrast with the situation for c < N/2), which when taken with 
A(k) <0 and C(k) <0, which are easy to see, gives 0 < z{”) < z$*. If 
A = 1 then the defining expression for B(k) in eq. (20) shows that 
B(k) > 0. Assume now that 1 < A. Then, 


2 
Bik) = N(i +A) (e-$) +20-m (F-*) 


2 
> N(1 +A) (<- 2) +2a-» (-- 2) 
2 2 
= 2¢ (e-F)(a-a+%B)>0 (73) 
2 c 


(vi) From our derivation of the quadratics it is clear that all roots 
are eigenvalues of D~™'M. As we have isolated exactly (N + 1) distinct 
values for the roots, there cannot be an eigenvalue that is not one of 
the roots. 

(vii) It follows from (iit) and (v) that the largest negative eigenvalue 
is a root of the quadratic corresponding to k = 0. The negative root in 
this case is — (1 + A — NA/c)/(1 — c/N), as shown in eq. (70). 
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Motion-compensated television coders generate data at a nonuni- 
form rate, which is smoothed by a buffer for transmission over a 
channel of constant bit rate. Spatial subsampling is one of the 
methods used to prevent overflow of the buffer, which would otherwise 
occur for scenes with complex motion from frame to frame. In this 
paper we evaluate the effects of spatial subsampling on the perform- 
ance of motion-compensated coders. In particular, we find that, 
although the quality of motion estimation does degrade in the pres- 
ence of subsampling, the degradation is not substantial. Use of 2:1 
horizontal subsampling, for example, results in bit rates that are 50 
percent lower compared with no subsampling for motion-compen- 
sated coders. This percentage is approximately the same for the 
conditional-replenishment coders. Spatial subsampling generally re- 
sults in blurring of the picture. We describe a technique for adaptive 
interpolation that results in blurring of only the unpredictable areas 
of the picture. The subjective quality in the presence of subsampling 
is thus improved considerably. In conclusion, our techniques for 
subsampling in a motion-compensated coder reduce the bit rate 
approximately by the same factor as subsampling in a conditional- 
replenishment coder, but result in a much better picture quality. 


|. INTRODUCTION 


Television signals contain a significant amount of frame-to-frame 

redundancy. Interframe coders attempt to exploit this redundancy by 

(t) Segmenting each television frame into two parts, one part that 

is predictable from the previous data, and one part that is unpredict- 
able. 

(iz) Transmitting two types of information: (a) addresses specifying 

the location of the picture elements in the unpredictable area, and (b) 
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information (usually quantized prediction error) by which the inten- 
sities of the unpredictable area can be updated. 

(tit) Matching the coder bit rate to the channel rate. Since the 
motion in a real television scene occurs randomly and in bursts, the 
amount of information about the unpredictable area will change as a 
function of time. It is transmitted over a constant bit-rate channel by, 
(a) storing it in a buffer prior to transmission to smooth out the 
transmitted information rate, and (b) using the buffer fullness to 
regulate the encoded bit rate by varying the amplitude, spatial, and 
temporal resolution of the television signal. Intensities of the picture 
elements (pels) in the unpredictable areas are transmitted by predic- 
tive coding. In conditional-replenishment coding,’* quantized values 
of frame difference, element difference, and line difference (or a com- 
bination thereof) are transmitted. In motion-compensated coders,” 
estimates of interframe translation of objects are obtained, and more 
efficient predictive coding is performed by taking differences of ele- 
ments from the previous frame that are appropriately translated. The 
translation is equal to the displacement of the object. In our previous 
papers’”’ we described several methods of displacement estimation 
and locally adaptive prediction to reduce the bit rate of interframe 
coders. Displacement estimation methods were recursive, which min- 
imized the motion-compensated prediction error by a steepest-descent 
algorithm. 

As mentioned before, most interframe coders require a buffer to 
smooth the output of the coder for transmission over a channel. One 
method of reducing the size of the buffer or preventing buffer overflow 
is to control the resolution by spatial subsampling. Typically, if the 
buffer starts to fill rapidly, resolution is decreased; resolution is in- 
creased if the buffer is nearly empty. It is not known how to optimally 
control the resolution of the unpredictable area for a given channel-bit 
rate and buffer size. However, many excellent resolution-channel al- 
gorithms have been designed on a trial and error basis. In this paper, 
we are concerned with spatial subsampling in motion-compensated 
coders, for the purpose of 

(t) Investigating to what extent spatial subsampling adversely 
affects the recursive-displacement estimation algorithm 

(it) Modifying the displacement estimator to increase its efficiency 
in the presence of subsampling 

(iit) Evaluating a new algorithm for adaptive interpolation that 
blurs only the unpredictable area (as compared with the “moving 
area”) in a conditional-replenishment coder 

(tv) Presenting simulation results on synthetic scenes with known 
displacement, and real scenes containing complex motion. 

Our simulations are restricted to horizontal subsampling by factors 
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of 2:1 and 4:1. Simulations indicate that spatial subsampling does 
degrade our displacement estimation. However, the degradation is not 
very serious. It is known (and confirmed by our simulations) that 2:1 
and 4:1 subsampling reduces the bit rates of conditional-replenishment 
coders approximately by a factor of two and four, respectively. We 
found that in the case of motion-compensated coders, 2:1 and 4:1 
subsampling reduces the bit rates of a motion-compensated coder by 
similar factors. In most conditional-replenishment coders, subsampling 
blurs the “moving areas” (i.e., pels for which amplitude of the frame 
difference is above a certain threshold); our adaptive interpolation 
algorithm blurs only the unpredictable areas. Thus, improvement in 
prediction reduces the blurred areas, thereby improving the picture 
quality. 


ll. ALGORITHM 


In this section, we describe the modifications to our displacement 
estimation algorithm. It is worthwhile, however, to look at the basic 
algorithm described in our earlier works.’ Let I(x, t) denote the 
intensity of a scene at the kth sample point x; in the scanning order of 
a frame, and let [(x;, t — +) denote the intensity at the same spatial 
location in the previous frame. If the scene consists of an object that 
is undergoing pure translation under uniform illumination, then, dis- 
regarding the background, 


I (Xa, t) ae I(x, ra D, t— T), (1) 
where D is the displacement (two-component vector) of the object in 
one frame interval, r. The pel-recursive algorithm obtains an estimate 
of D (i.e., D) by recursively minimizing the square of the displaced- 
frame difference at the current pel location. The displaced-frame 
difference DFD(-, -) is defined by 

DFD(x;, D) = I(xz, t) — I(x, —D, t — 7). (2) 

The minimization is performed by a steepest-descent algorithm of 
the form 

Diss = Dz — %eVo[DFD(xr11, Di) F, (3) 


where Vp[-] is the two-dimensional gradient with respect to D. Equa- 
tion (3) can be expanded to 


Dest = D, — eDFD (xia, Dp) VI(Kn+1 — Dz, t-— 7), (4) 


where V = V, is the two-dimensional spatial-gradient operator with 
respect to horizontal and vertical coordinates of vector x. We use a 
finite-difference approximation for the gradient, which is formed by 
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element difference, EDIF;, and line difference, LDIF;, using the pel 
closest to the point x,1: — D, in the previous frame. The above 
displacement estimator requires multiplication at each iteration, which 
is undesirable for hardware implementation and is therefore simplified 
to: 


Divi = D, — € sgn| DFD(x:+1, D,) |-sgn| VI(xr41 — D;, t — 7) |, (5) 


where 
-1lifZ<-—T 
sgn(z)= | 0,if|Z|= T (6) 
+1, otherwise. 


The above recursion to update D, is carried out only in the moving 

areas of the current frame, i.e., for those pels where 

+p 

YY | L(x, £) — I(Knsj, t - t)| = Threshold . 

J=—pP 

The motion-compensated coder predicts intensity, J(x;, ¢), using either 
the previous frame intensity, I(x, ¢ — +), or displaced-previous-frame 
intensity, I(x, — Dex t — rt), based on which predictor results in less 
error for certain already transmitted neighbors. We note that if a point 
x — D does not lie on the grid formed by the pels, then interpolation 
is required to evaluate I(x, — D,-1, t — 7). The displacement at either 
the previous pel or the previous line element is used to form the 
displaced-previous-frame prediction of the present pel. This allows the 
receiver to compute displaced-previous-frame predictions without ex- 
plicit transmission of the displacement. If the magnitude of the predic- 
tion error exceeds a predetermined threshold, the coder transmits a 
quantized version of the prediction error and the necessary addressing 
information to the receiver. 

Above we described the basic motion-compensation algorithm. We 
now describe its modifications relevant to the spatially subsampled 
television signal. Although the algorithm we describe below can be 
applied to signals subsampled in a variety of ways, we restrict ourselves 
to horizontal (along a scan line) subsampling by a factor of two and 
four to one. Modifications and details of each component of the 
motion-compensated system are given below. 


2.1 Displacement estimator 


Figure 1 shows the pel arrangement used for the updating process of 
the displacement estimator. Since the subsampled and subsequently 
interpolated pels contain more noise, they are given less importance in 
the updating process. This is done in two ways. The frame-difference 


signal at subsampled pels is given less weight in determining where 
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Fig. 1—Pel configuration for a displacement-updating process. Displacement is re- 
cursively updated only at those pels where the weighted sum of frame difference in a 
window around pel & is above a threshold. 


the displacement is updated and a smaller updating constant (‘epsilon’) 
is used at subsampled pels. Thus, the estimator works as follows: 


EDIF;, (7) 


Dz = Dp_1 — eSGN| DFD(x», Dp-1) |-SGN LDIF, 








where 


€1, if pel x, is subsampled 
e= . (8) 

€, otherwise . 
We have found that the estimator performance is improved by choos- 
ing €: < € (both positive numbers). The recursion is carried out only 
at those pels where a weighted sum of the magnitude of the frame 
difference in a window around pel k exceeds a given threshold. Thus, 
D,-1 is updated only if 


P 
Y\ w,| FDIF (x:+;)| = THRESH, ;sx (9) 
J=—p 

otherwise D, = D,-1. The weights {w;} are nonnegative and are lower 
for subsampled pels compared with the nonsubsampled pels. The 
threshold, THRESH, is preselected and optimized by simulations, 
and the displacement D,, was used for the prediction of the element at 
location & in the next line (see Fig. 1). 


2.2 Predictor and predictor selection 


As in our earlier works, we have used only predictors based on 
intensities in the previous frame. The previous-frame and displaced- 
previous-frame predictions are calculated for each pel, whether sub- 
sampled or not. Of course, other predictions (e.g., intrafield predictors 
such as previous element or line) can be used to augment our prediction 
strategy. Having computed both the previous-frame and displaced- 
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frame predictors, we use the following rule to switch adaptively be- 
tween them on a pel-by-pel basis. Referring to Fig. 1, we use the frame- 
difference predictor if 


+m 


Y w,| FDIF (xr+;)| S x w,| DFD(x:+;, D,) |; (10) 


j=-m jam 


otherwise we use the displaced-frame predictor. The above inequality 
is evaluated for a window of size (2m + 1) pels centered around pel k. 
Again, less weight is given to pels that are subsampled, i.e., w; is lower 
for subsampled pels. In the calculation of FDIF(-) and DFD(., -), 
sometimes the use of interpolated pels in the previous frame may be 
required. 


2.3 Subsampling and interpolation 


We considered several patterns for subsampling. Some patterns were 
such that they did not change from line to line, field to field, or frame 
to frame, whereas some were intentionally staggered. Some of these 
are described in Section 2.4 Having selected a subsampling pattern, 
we then interpolated the missing pels using an adaptive technique. If 
the magnitude of the prediction error for the nearest nonsubsampled 
pels to the right and left was below a threshold, then the intensity of 
the subsampled pel was replaced by its prediction. If, on the other 
hand, either the closest right or left nonsubsampled neighbor had 
prediction-error magnitude above the threshold, then a simple linear 
(horizontal) interpolation was used to reconstruct the intensity of the 
subsampled pel. The threshold used is the same as the one that 
determines whether the quantization error is transmitted. This type of 
adaptive interpolation improves with the quality of prediction. The 
conditional-replenishment coders subsample the “moving area” pels, 
which are determined by the frame difference signal. This has the 
effect of blurring the entire moving area even if more sophisticated 
predictors are used. Our strategy, on the other hand, replaces all the 
“predictable” pels (as determined by the closest neighbors) by their 
prediction rather than by spatial interpolation. Thus, only the unpre- 
dictable areas are blurred by spatial interpolation. 


2.4 Transmitted information 


As previously mentioned, we transmitted to the receiver the quan- 
tized prediction error of every nonsubsampled pel where magnitude of 
prediction error was above a threshold, called the replenishment 
threshold. This classifies pels into predictable and unpredictable pels. 


mm ath nraanntatairyva lawala at 
A 35-level symmetric quantizer was used with representative levels at 


0, 3, 6, 11, 16, 21, 28, 35, 44, 53, 64, 77, 92, 109, 128, 149, 178, and 197 
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(on an 8-bit scale of 0 to 255). We obtained decision levels by averaging 
the adjacent representative levels. A code set for representing the 
quantizer levels was not designed, but entropies were computed. In 
addition, horizontal run lengths of predictable and unpredictable pels 
were transmitted. Here again, entropies of the predictable- and unpre- 
dictable-pel run lengths were calculated. This assumes that in practice 
separate code sets will be used for run lengths of predictable and 
unpredictable pels. 


lll, SIMULATIONS AND RESULTS 


Computer simulations were performed on two types of scenes. The 
first was a synthetic scene that was computer-generated. It was a 
damped radial cosine in intensity with a radius of 60 pels, which 
translated from frame to frame by a given amount. The pattern is 
described mathematically by the intensity function 


I(R) = 100-exp(—0.01R)cos(27R/P), 0 = 60, 


where R is the radial distance from the center [taken to be (100,100) ] 
and 


P = (1— R/60)10 + 10. 


This function is displayed on a 256- by 256-element raster in two 
interlaced fields of 128 lines each. The pattern is shown as Fig. 6 in 
Ref. 7. The other scene, called Judy, is a head and shoulders view of 
a person engaged in active conversation. This consisted of 50 frames 
obtained by taking a Nyquist-rate sampling of a video signal having a 
1-MHz bandwidth. Each sample was quantized uniformly to 8 bits. 
Four frames of this sequence are shown in Fig. 4 of Ref. 5. 


3.1 Synthetic Scene 


The simulations on the synthetic scene were restricted to 2:1 sub- 
sampling with a subsampling pattern that did not change from line to 
line and field to field (referred to as a nonstaggered pattern). The 
purpose of this simulation was to evaluate the degradation in the 
performance of the displacement estimator. Therefore, only the dis- 
placement was calculated, without using it for coding. Figures 2 and 3 
show the relative displacement error as a function of the iteration 
number. The iteration number in this case refers to only those in- 
stances where the displacement was actually updated. Owing to several 
factors [e.g., the setting of the threshold in eq. (9)], a given iteration 
number in a subsampled case may not be at the same location in the 
nonsampled case. Figure 2 shows the case when the pattern moves at 
4 pels per frame, whereas Fig. 3 shows the results for displacement of 
5 pels per frame. The parameters of the displacement estimator (of 
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Fig. 2—Relative error in displacement for a synthetic moving pattern at 4 pels per 
frame as a function of iteration number. 
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Fig. 3—Relative error in displacement for a synthetic moving pattern at 5 pels per 
frame as a function of iteration number. 
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Fig. 4—Bits required per field for conditional-replenishment scheme using a fine (511- 
level) quantizer. 


Section II) were selected by trial and error. It is seen from both figures 
that the convergence of the displacement estimator is more rapid when 
there is no subsampling. The degradation appears to be somewhat less 
for the 5-pel/frame case as compared with the 4-pel/frame case. The 
effect of this degradation on the motion-compensated coder is evalu- 
ated in Section 3.2. 
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Fig. 5—Bits required per field for motion-compensation scheme using a fine (511- 
level) quantizer. 


3.2 Real Scene 


Before the performance of the subsampled motion-compensated 
coders is given, it is important to note the parameters that were chosen 
for simulations. These choices were based on trial and error. While 
this may not have resulted in optimum choices, our choices may not 
be too far from the optimum. 

The epsilon of eq. (8) was different for subsampled pels compared 
with nonsubsampled pels; €; was 4, and €2 was %2. The parameters in 
the update condition of eq. (9) were taken to be: p = 1, and THRESH, 
= 4 (ona scale of 0 to 255, 8-bits). The weight given to subsampled pels 
was 1 and to nonsubsampled pels was 2. Thus, through e€ and these 
weights, subsampled pels were given less “importance” in the displace- 
ment estimation process. The predictor selection [eq. (10)] was done 
by using a window of 3 (i.e., m = 1), and the subsampled pels were 
given weight w; = 1, whereas nonsubsampled pels were given weight 


1904 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1982 


/ 


7 
CONDITIONAL REPLENISHMENT 
(NO SUBSAMPLING) 


CONDITIONAL REPLENISHMENT 


y (2:1 SUBSAMPLING) 


KILOBITS PER FIELD 


/ 
/ 


CONDITIONAL REPLENISH MENT 
(4:1 SUBSAMPLING) 





0 10 20 30 40 50 60 70 80 90 100 
FIELD NUMBER 


Fig. 6—Bits required per field for conditional-replenishment scheme using a coarse 
35-level quantizer. 


w; = 4. The displacement estimator was not initialized at the beginning 
of each scanning line; thus, the estimate from the last pel of the 
previous line was used as the initial estimate for the first pel of the 
next line. 

The performance was measured by the number of coded bits that 
were required for each field (approximated by appropriate entropies). 
When the coarse quantizer of Section 2.4 was used, then the total 
squared-interpolation error was also calculated for each field. Figures 
4 and 5 show the coded bits for both conditional replenishment and 
motion compensation, when no coarse quantization was performed 
(i.e., quantizer with 511 levels was used). It is obvious that 2:1 subsam- 
pling reduces the bits/field by approximately two for both conditional 
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replenishment and motion compensation. The decrease in bit rates is 
high for those fields with a large amount of motion. The 4:1 subsam- 
pling reduces the bit rates of motion-compensation schemes much 
more significantly compared with conditional replenishment. However, 
this may be a peculiarity of the particular scene we used for simulation. 
We used a few more scenes and found that 4:1 subsampling, in general, 
reduced the rate by a factor of two compared with 2:1 subsampling for 
both conditional replenishment and motion compensation. 

Using the 35-level quantizer mentioned earlier, the bit rates are 
plotted in Figs. 6 and 7. Figure 6 shows conditional replenishment and 
Fig. 7 shows motion compensation. These figures indicate that motion 
compensation results in approximately 60-percent reduction for no 
subsampling, approximately 80-percent reduction for 2:1 subsampling, 
and about 90-percent reduction for 4:1 subsampling, compared with 
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Fig. 7—Bits required per field for motion-compensation scheme using a coarse 35- 
level quantizer. 
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Fig. 8—Plot of squared-interpolation error summed over entire field versus field 
number of 2:1 spatial subsampling. Adaptive interpolation is used in the case of motion 
compensation. Fixed-linear one-dimensional spatial interpolation is used for conditional 
replenishment. 


conditional replenishment. Obviously these conclusions are scene-de- 
pendent. However, for scenes containing significant translational mo- 
tion, such conclusions may remain valid. It is always difficult to 
evaluate quality of short segments of scenes. We made informal 
observations to compare pictures resulting from conditional replenish- 
ment and motion compensation. Subsampling results in visible blur- 
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ring. However, it was found that, owing to our adaptive interpolation 
scheme, motion compensation blurred a much smaller area than did 
conditional replenishment. Also, the blurred areas in motion compen- 
sation appear to be much more fragmented and somewhat randomly 
distributed, which also decreases their visibility. Figures 8 and 9 show 
the plots of squared-interpolation error per field versus the field 
number for both 2:1 and 4:1 subsampling. Curves for both frame- 
difference conditional replenishment and motion compensation are 
shown. In the case of 2:1 subsampling, the interpolation error decreases 
by almost a factor of two using motion compensation. This decrease is 
even greater for 4:1 subsampling. 

In our simulations we also tried staggering the subsampling pattern 
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Fig. 9—Plot of squared interpolation error summed over an entire field versus field 
number for a 4:1 spatial subsampling. 
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from line to line, field to field, and frame to frame. This makes the 
quantization and the interpolation noise appear random and less 
patterned. However, field-to-field or frame-to-frame staggering results 
in annoying flicker, as expected. We thus found line-to-line staggering 
to be most useful. Although staggering improved the quality of pictures 
for both conditional replenishment and motion compensation, the 
improvement was somewhat higher for conditional replenishment. The 
required bits per field did not change significantly because of stagger- 
ing. Thus, we might conclude that line-to-line staggered-subsampling 
patterns improve the quality of pictures without any significant in- 
crease in the bit rates. 


IV. SUMMARY AND CONCLUSIONS 


We have presented in this paper schemes for motion compensation 
in the presence of spatial subsampling, which is required in interframe 
coders to prevent buffer overflow.We also described an adaptive inter- 
polation scheme that blurred only the “unpredictable” area during 
subsampling. Computer simulations were performed in synthetic 
scenes to evaluate degradation of the displacement estimator in the 
presence of subsampling. It was found that, although the quality of 
displacement estimation was degraded in the presence of spatial sub- 
sampling, the effect on the bit rates was not significant. Compared 
with conditional replenishment, motion compensation reduced the bit 
rates by a factor of two or more even during subsampling. Thus, a 2:1 
subsampled motion-compensated coder results in about one quarter of 
the bit rate of conditional replenishment and about one half of the bit 
rate of the 2:1 subsampled conditional-replenishment coder. Our adap- 
tive interpolation scheme blurs only the “unpredictable area” rather 
than the “moving area’ that is blurred in subsampled conditional- 
replenishment coders. Since “unpredictable area” is a subset of moving 
area and is fragmented randomly, the blurring caused by subsampling 
in the case of a motion-compensated coder is much less visible. This 
is also borne out by the total-interpolation error, which decreases by 
more than a factor of two using adaptive interpolation. 
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The Detection of Long Error Bursts During 
Transmission of Video Signals 


By K. JANAC and N. J. A. SLOANE 


(Manuscript received January 4, 1982) 


In this paper we describe a simple threshold code for monitoring 
digital communication channels which, with high probability, will 
detect the presence of an error burst of length at least 70, say, but will 
ignore shorter bursts. This code has applications to digital systems, 
such as video-conferencing systems, which have an existing error- 
correction procedure that can handle small numbers of random 
errors and bursts of length less than some fixed number, but which 
are adversely affected by longer bursts. 


1. INTRODUCTION 


Sophisticated techniques are available for removing redundancy in 
TV pictures.’” These reduce the cost of transmitting video signals over 
telephone lines, as, for example, in video-conferencing systems.** The 
compressed video signal is represented in digital form and is protected 
by coding’ against the most frequently occurring channel errors.’ 
Certain other, less common, error patterns, however, are also detri- 
mental to the system, causing the picture to disappear momentarily, 
a phenomenon that is annoying to the viewer. In this paper we describe 
a coding scheme that detects a class of error patterns that contributes 
to this degradation of the system. The scheme can also be used for 
other digital transmission systems where similar coding procedures are 
employed. 


Il. MODEL OF A DIGITAL VIDEO-COMMUNICATION SYSTEM 


Figure 1 shows a simplified model of a digital video-communication 
system, as used, for example, in video-conferencing services.'* This 
model emphasizes the error-correcting mechanism. Video data (a) 
from the camera is first passed through a picture coder or data 
compressor (b) that removes some of the redundancy from the picture. 


1911 









PICTURE 
CODING 


MULTI- 
PLEXER 






2 
} CHANNELS 





LEAVER 






(a) (b) (d) {e) 


Fig. 1—Error-correcting mechanism in a typical video communication system. 


The resulting bit stream is encoded (c) by an error-correcting code, a 
typical code being the [255, 239, 5] double-error-correcting Bose-Chau- 
dhuri-Hocquenghem (BcH) code.® The output from the encoder is 
interleaved (d), say, to a depth of 35. (This is explained in greater 
detail below.) Finally, a multiplexer (e) combines the video data from 
the interleaver with audio and control information, and distributes the 
result among two high-speed (typically 1.544 Mb/s) channels for 
transmission. At the receiving end the inverse operations are per- 
formed. The audio and control data have a much lower rate than the 
video data, and can usually be ignored in the analysis of errors. (Other 
systems use one or even four transmission channels, but it is not 
difficult to show that our proposed coding scheme applies equally well 
to these situations.) 

To understand how errors on one of the channels affect the picture, 
we must consider how the interleaver and multiplexer together distrib- 
ute the bits among the channels. This is explained in Fig. 2. The 
interleaver consists of an array (b;) of 35 x 255 bits (Fig. 2a). Code 
words from the BCH encoder enter the array by rows and are read out 
by columns. Figure 2b shows the same bits distributed among the two 
channels. We see that 35 bits now separate any pair of adjacent bits in 
a code word. 

Thus an error burst* of length <= 34 on a channel will affect at most 
one bit from a code word, a burst of length 35 to 69 at most two bits, 
a burst of length 70 to 104 at most three bits, and so on. If not more 
than two bits in a code word are in error, this will be corrected by the 
BCH code. Thus error bursts of length <= 69 on any channel are not 
significant. On the other hand, bursts of length => 70 may cause three 
or more errors in a code word, and this cannot be corrected by the BCH 


* Definition of puns Suppose x), --- , X, is transmitted and yi, +++, Yn is received. 
The difference Qi, +++ Cn = Yi — Xi, Yo — Xe, +++, Yn — Xp is the error pattern. If the 
vector é1, «++, €n is 0 except for a string of length b, which begins and ends with a 1, we 


es fe a burst of length 6 has occurred. For example, 00110101000 contains a burst of 
engt 
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Fig. 2—The interleaver and multiplexer distributing bits among channels. (a) 35 BCH 
code words are loaded by rows into the array in the interleaver. (b) The same bits after 
they have been read out of the array by columns and multiplexed between the two 
channels. Bits from the first row of the array are marked with circles. 





code. However, in most cases, the presence of such an error pattern 
will be “detected,” either by the BCH code or by the parity checks 
associated with the picture coder. When this happens, the receiver 
normally substitutes the appropriate portion of the previous frame for 
the missing segment. But if too many errors of this type are detected 
in one frame, the whole frame must be retransmitted, instead of, for 
example, just sending the difference from the previous frame. This 
leaves the screen momentarily blank, which is disturbing to the viewer 
and is what we are trying to avoid. The penalty for a burst of length 
greatly exceeding 70 is much higher, for more rows of the array will be 
affected. 

The problem then is to detect whether either channel has suffered 
from an error burst of length greater than 70. When such a channel 
has been identified, the appropriate corrective action can be taken 
(possibly including switching the transmission to a backup channel if 
the long bursts occur frequently). 

The problem is somewhat unusual, since most existing burst detec- 
tion schemes (compare with Ref. 7) are designed to detect the presence 
of any burst of length up to some number b;, while here we are only 
concerned with detecting bursts of length greater than by. 


Il. BURST DETECTING CODE 


The code we propose is a simple threshold scheme, to be used 
separately on each channel. There are many possible variations of this 
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code, but we shall just describe the simplest version. The data is 
usually transmitted down each channel in blocks of some fixed length, 
say 4200. We wish to detect the presence of an error burst of length 
> 70, occurring either inside a block or overlapping two adjacent 
blocks. 


3.1 Encoding 


Our encoding procedure takes the last 70 bits of the previous block 
and the 4200 bits of the current block, a total of 4270 = 70 x 61 bits, 
which we denote by 


Uj, U2, +*° , U4270, 


and forms the following check sums: 


V1 = Uy + U7 + U4, + eee) + 4201, 


V2 = U2 + U72 + Ur4g + +++ + U4202, 


U790 = U7 + U140 + U2i0 + +++ + U4270, 


the addition being carried out modulo 2. In other words, the 4270 bits 
are divided into 61 subblocks of length 70 and the modulo 2 sum v = 
(U1, +++, Uz) of these subblocks is determined, without carries. This 
check vector v is then transmitted along with the current block. (Of 
course, parity checks have been used since the beginning of coding 
theory, but as far as we know, their use in this particular configuration 
is new.) 


3.2 Decoding 


When the bit stream on this channel reaches the receiver, the check 
sums are recomputed from the same 4270 bits, producing a vector w 
= (Wi, +++ , Wi), say. The decoder determines the number WN of places 
where w differs from the received version of v. Of course, if there are 
no errors, then w = v and N = 0. The decoder compares N with a 
fixed threshold 6. If N < 6, the decoder decides that the channel is 
acceptable (and there is only a low probability that a burst of length 
> 70 has occurred). If N > 0, the decoder decides that there is a 
significant probability that a burst of length = 70 has occurred, and 
requests that the appropriate corrective action be taken. The basic 
configuration of the scheme is illustrated on one channel by the block 
diagram shown in Fig. 3. A similar arrangement is realized in all four 
channels (two in each direction). The implementation of this scheme 
clearly presents no difficulties. 
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Fig. 3—Simplified block diagram of the proposed scheme for one channel showing 
the mth data block w”. 


3.3 Choice of Threshold 


The following argument shows that an initial threshold of @ = 22 is 
a good, conservative choice. (However, @ can be left as a variable 
parameter in the system.) Let us assume that a random burst of errors 
of length b will begin at any point in the bit stream, and will comple- 
ment each of the following 6 bits with probability 1/2. On the average, 
b/2 bits will be wrong. If 5 is less than 70, each of the errors will affect 
just one check bit, and so there will be about b/2 places where vu; ¥ w;, 
implying N ~ 6/2. On the other hand, if b = 70, all the check bits will 
be involved, and we can expect N =~ 35. However, these are just the 
average values of N, and the actual values will be different. The 
variance o” of N is roughly b/4, which is 17.5 when b = 70. Therefore, 
an initial threshold of 6 = 35 — 30 = 22 is recommended. The optimal 
value can then be determined by experiment. A more precise deter- 
mination of the threshold cannot be made at this time owing to the 
uncertainty about the density of 1’s inside a burst and of the distri- 
bution of burst lengths. : 

The problem is essentially one of hypothesis testing, of distinguish- 
ing between the “null hypothesis” that the channel is acceptable and 
the hypothesis that a burst of length = 70 has occurred. Let us assume 
that the burst lengths have an exponential distribution, i.e., that a 
burst of length b = 0, 1, 2, --- occurs with probability 
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where A is a parameter and c, = 1 — e~*. Then two important quantities 
associated with our code are the “false alarm” probability, which is 


a = Prob(N > @|channel is good) 


69 i b b a 
= ce —, 1 
2 : net & 2° (1) 


and the probability of an undetected burst, which is 
B = Prob(N S 6| channel is bad) 


ge ms (ae 

dy ed Ps (7) oe (2) 
If \ is known, these two probabilities can be calculated easily from eqs. 
(1) and (2) and the best value of @ chosen accordingly. Presumably, 6 
should be made much smaller than a, but this decision must be made 
by the system designer. 

Some possible variations on this scheme follow: (i) v could be sent 
over a separate, slower channel. (ii) The number of check bits, i.e., the 
length of v, could be made greater than 70, thus supplying more 
information about the state of the channel. [Equations (1) and (2) then 
need to be slightly modified.] (iii) The decoder could use two thresh- 
olds, and declare the channel good if N < 6;, bad if N > 62, and 
questionable if 4; = N = @2. 


IV. SUMMARY 


The existing error-detecting procedures for digital communication 
channels do not distinguish long error bursts from other types of 
errors, even though in the case of compressed video signals these long 
bursts contribute to fast retransmissions of the whole frame, leaving 
the screen momentarily blank, which is annoying to the viewer. We 
have proposed a simple threshold code that is compatible with the 
present video-compression and communication-channels systems, and 
will detect such long bursts with high probability. Formulas are given 
for the probability of an undetected burst and of a false alarm. The 
code can be used for monitoring the burstiness of digital communica- 
tion channels for other services. 
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Considerations for Single-Mode Fiber Systems 


By K. OGAWA 
(Manuscript received January 12, 1982) 


The intrinsic low-dispersion and low-attenuation properties of 
single-mode fibers between 1.3 and 1.6 um make them attractive for 
use in high-capacity, long-haul digital systems. In this paper we 
discuss some fundamental performance limitations—such as atten- 
uation, bandwidth, mode-partition noise, burst-type error, and re- 
ceiver sensitivities—for systems with bit rates above 274 Mb/s. Also, 
we discuss the maximum capacity achievable by either using a single 
channel at the minimum-dispersion wavelength, or multiple wave- 
length-multiplexed channels with equal, but necessarily lower, bit 
rates. We conclude that the characteristics of present laser diodes 
limit repeater spacing to lengths far less than the potential capacity 
expected from single-mode fibers. For total capacity of bit rates less 
than 1 Gb/s, wavelength multiplexing is found to offer longer repeater 
spacings than single-wavelength systems. 


|. INTRODUCTION 


The intrinsic low-dispersion property of single-mode fibers makes 
them attractive for high-capacity, long-haul lightwave systems,’” es- 
pecially in the wavelength region between 1.1 and 1.7 ym, where low 
attenuation (less than 1 dB) has been demonstrated.*? Minimum dis- 
persion can be achieved over a range of wavelengths above 1.3 ym by 
controlling Ge-doping density and core size.*® Low dispersion over a 
wide range of wavelengths can be achieved by introducing a certain 
amount of waveguide dispersion.”® As in any communication system, 
there are three basic origins of system-performance limitations and 
degradations: (i) sources, (iz) media, and (iii) receivers. Some limita- 
tions, such as receiver sensitivity and source-output power, are deter- 
mined by a single component. However, many other performance 
limitations and degradations are caused by the interaction of two 
components. 

Laser diodes as sources have several characteristics that result in 
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Table I—Degradation factors of laser diodes 

















Characteristics Related Fiber Degradation Ef- 
of Laser Characteristics fects to System 
Center wave- Manufacturing varia- Bandwidth (with 
length tion +3 nm fiber disper- 
sion) 
Temperature variation Bandwidth (with 
0.5 nm/°C fiber disper- 
sion) 
Mode skipping Burst error 
(jumps) 6 ~ 10 AA 
Chromatic 
dispersion 
Spectrum (half- Half rms width 2 ~ 4 Bandwidth of fi- 
rms width) nm ber 
Spectrum broadening 
Mode partition (0Sk Mode-partition 
S1) noise 
Intrinsic noise Quantum noise N.A. and A (cou- Noise 
Transverse mode fluc- pling to fiber) Power fluctuation 
tuation 
Reflection Spectrum noise Fiber end Noise 
Self-pulsing Pigtail length 
Bandwidth 1 ~ 2 GHz Bandwidth of 


transmitter 


severe limitations on the performance of a single-mode fiber system. 
As indicated in Table I, these characteristics, such as spectral distri- 
bution,® longitudinal mode-partition effects,*"' and abrupt jumps’”” 
of the longitudinal or transverse lasing modes,’® cause a degree of 
system degradation that depends on the characteristics of the fiber 
(chromatic dispersion). There are also fundamental performance limi- 
tations and degradation factors for single-mode fiber systems caused 
by fiber attenuation and bandwidth,'** mode-partition noise,”'*”” and 
limited receiver sensitivity.’° 

Because of these limitations and their mutual interactions, it is a 
problem when designing a lightwave system with a given bit rate per 
fiber to decide whether to use a single high-speed digital signal or 
several lower-speed channels multiplexed on different wavelength 
lightwaves. This paper will discuss this problem and develop some 
general guidelines for solving it. 


ll. PERFORMANCE LIMITATIONS AND DEGRADATIONS 


Table I shows detrimental aspects of laser diodes and Table II shows 
degradation factors caused by fibers. In this paper, we will focus on 
five fundamental factors that limit system performance: (z) fiber 
attenuation, (iz) fiber bandwidth, (ziz) mode-partition noise, (iv) burst- 
type errors caused by mode jumps, and (v) receiver sensitivity. 
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Table Il—Characteristics of fiber 
Loss 


Intrinsic loss ® Infrared absorption 
Ultraviolet absorption @ Wavelength dependency 
Rayleigh scattering 

















OH-absorption @ 1.24 um 
@ 1.38 um 

Imperfection of wave- © Boundary of core and Refractive index 
guide cladding 

@ Bubbles Difference dependency 

® Core size variation 
Microbending @ Coating © Core size and refractive index 

difference 

® Stress @ Wavelength dependency 

Splicing loss 
Dispersion 

Material dispersion ® Doping material @ Wavelength dependency 





Waveguide dispersion © Core size and refractive @ Wavelength dependency 
index difference 


ATTENUATION IN DECIBELS PER KILOMETER 





WAVELENGTH IN MICROMETERS 


Fig. 1—Fiber attenuation spectrum of a Ge-doped core single-mode fiber. 


2.1 Fiber attenuation 


Figure 1 shows the theoretical attenuation of single-mode fiber (Ge- 
doped core) caused by Rayleigh scattering, infrared absorption, and 
ultraviolet absorption. The crosshatched area in Fig. 1 shows the 
theoretical limitation (refractive index difference A = 0.2 to 0.7 percent) 
without the water absorption peaks at 1.38 and 1.24 um. The water 
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absorption peaks at 1.24 and 1.38 um are expected to be eliminated 
eventually by improvement of the dehydration process. There are 
additional attenuation factors such as (i) imperfection losses of the 
waveguide,” (ii) microbending losses,”’ and (iii) splicing losses.”)” 
Considering these additional losses, probably 0.5 dB/km total fiber 
loss at 1.3 xm is the theoretical limit, and 1 dB/km can be assumed to 
be obtainable for practical fiber systems. 


2.2 Bandwidth, mode-partition noise, and burst error by mode jumps 


Bandwidth, mode-partition noise, and burst error caused by mode 
jumps are all generic functions of laser and fiber characteristics. If 
laser diodes are stabilized someday either by injection locking,” a new 
laser-cavity configuration,“ or other methods (such as the use of an 
external modulator),” then these will not limit system performance so 
severely. However, at present these three factors are important con- 
siderations for single-mode systems. 


2.2.1 Bandwidth limitations 


The bandwidth of a system using a well-stabilized single-longitudi- 
nal-mode laser would be limited by the degenerate-polarized mode 
dispersion of the fiber,” not by the laser spectrum. Since most laser 
diodes show spectral broadening under direct-current modulation, the 
bandwidth is determined by the source spectrum and the chromatic 
dispersion of the fiber. Figure 2 shows the bandwidth of single-mode 
fibers. Two curves are shown for source rms half-widths of 1 nm and 
2 nm. The wavelength of minimum dispersion is assumed to be 1.3 pm. 
The overall transmission bandwidth is determined by chromatic dis- 
persion of the fiber and by the laser spectrum, as follows:'°" 


187.3 








feas = —————[GHz] off the wavelength of 
Oe dt minimum dispersion 
dx 
616.4 
feas = ——————[GHz] at the wavelength of (1) 
d? 
Figo minimum dispersion, 
dv? 








where feas[GHz] is the bandwidth of fiber (3-dB optical bandwidth), Z 
re d : ‘ ; ; 
is its length, | a [ps/nm-km] is the differential chromatic dispersion 


2 
Fe [ps/nm’?-km] is the third-order chromatic dispersion, and o is 
the half-rms width of laser spectrum. 
If we permit a 3-dB power penalty at an error rate of 10-°, then the 
signal bit rate, B(b/s), and fiber bandwidth, fean, are related by” 
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Fig. 2—Bandwidth of a single-mode fiber vs. wavelength. 


feas = 0.55B. (2) 


Combining equations (1) and (2), we obtain the performance limitation 
caused by bandwidth as a bit rate-distance product, B-Z, as shown 
below. 





B-Zs seen [Gb/s-km] off the wavelength of 
i dt | minimum dispersion 
dn 
B-Zs — [Gb/s-km] at the wavelength of (3) 
a: dt minimum dispersion. 
dd” 








2.2.2 Mode-partition noise 


As discussed in Ref. 10, mode-partition noise can become a serious 
performance limitation. The bit-rate-distance product caused by this 
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partition noise is expressed for the condition that the asymptotic error 
rate be 107!” (or a power penalty or 1.5 dB at 10~° error rate) 








B-Z=E pe Ch ial off the wavelength of 
6 Or afk minimum dispersion 
1173 
B.Z =——_.—_|Gb/s- km] at the wavelength of (4) 
2 ao di ; 
Oo. >? Vk minimum dispersion, 








where k is the mode-partition noise-suppression factor (0 S k S 1). 
Measured results indicate that k lies between 0.4 and 0.7 and a 
theoretical analysis of k shows that it cannot be zero and is about 0.1 
even with dc operation. A comparison of eqs. (3) and (4) shows that 
the effect of partition noise is greater than that of bandwidth when k 
is greater than 0.1. Therefore, mode-partition noise is the dominant 
effect on the system. Figure 3 shows the bit-rate-distance product 
caused by the mode-partition noise in the worst case, which is k = 1. 
Two curves are shown for source rms half-widths of 1 nm and 2 nm. 
The wavelength of minimum dispersion is assumed to be 1.3 ym. 


2.2.3 Burst error caused by mode jumps 


Because of temperature variations or drive-current variations, some 
laser diodes show sudden jumps of the center wavelength. This causes 
a burst of errors until the retiming circuit locks into the shifted phase. 
This burst-type error will happen randomly at any repeater. To keep 
the system error rate within system requirements, there are two design 
approaches: 

(t) Error-rate objective—worst short-term error rate must be under 
10° when this burst error is a frequent random phenomenon. 

(ti) Error-free-seconds objective—the error-free seconds must be at 
least 95 percent when burst errors occur occasionally. 

If we assume that mode jumps at each repeater are rather frequent 
random phenomena, unlike fading or lightning, then a 6400-km ter- 
minal-to-terminal error-rate objective should be applied to this burst- 
type error caused by mode jumps. From Ref. 8, the bit-rate-distance 
product in the worst case is then given by the relation 








225 
B-Zs ae oe [Gb/s-km], 
T 
—|-AA 
r 
where AAInm] is the center-wavelength shift for a typical jump. 
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Fig. 3—Bit-rate-distance product caused by mode-partition noise vs. wavelength. 


2.3 Receiver sensitivity 


There are two basic design approaches used to obtain highly sensi- 
tive receivers: (t) InGaAs-pin photodiode’ with an ultra-low noise 
amplifier, and (ii) Ge” or InGaAs avalanche photodiode” with a low- 
noise amplifier. For the first approach, there are several device tech- 
nologies that provide ultra-low noise amplifiers above 274 Mb/s: (z) Si- 
microwave bipolar transistors,*' (ii) GaAs-MESFETs,” and (iii) Si 
short-channel MosFETs.’®*’ Although Si bipolar transistors will deliver 
better noise characteristics than FETs above 300 Mb/s, theoretically 
FETs are still useful for high bit-rate amplifiers because the cut-off 
frequency (~4 GHz) of practical bipolar transistors is lower than that 
of FETs (10 ~ 25 GHz). An avalanche photodiode APD receiver delivers 
much higher sensitivities, theoretically, even with its intrinsic excess 
noise factor. However, the sensitivity achieved using a practical 
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GeAPD is comparable to InGaAs pin with a ultra-low noise amplifier. 
In general, the practical value of sensitivity can be expressed as 


P = —34 + 10 log B[dBm], 
where B is [Gb/s]. 


Il. REPEATER SPAN 


The achievable repeater spacing at different bit rates above 274 
Mb/s can now be found assuming 0 dBm of average output power from 
laser diodes into the fiber. Figures 4 and 5 show two different cases: 
Fig. 4 shows that the half-rms width of laser spectrum is 2 nm and 
mode jumping Ad is less than 3 nm. Solid lines indicate performance 
limitations due to fiber attenuation (0.5 dB/km and 1.0 dB/km). The 
dot-dash lines are the fiber bandwidth of 1.3 wm and 1.275 pm. 
However, the mode-partition noise limitations indicated by the dashed 
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Fig. 4—Theoretical limits of repeater span due to the mode-partition noise limitation. 
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Fig. 5—Theoretical limits of repeater span due to the burst-error limitation. 


lines are much more severe than the fiber bandwidth limitations. Also, 
when the center wavelength shift caused by a mode jump is 2.9 nm, 
the burst-error limitation is the same as the mode-partition noise 
limitation for 1.275 um. Figure 5 shows that the half-rms width of laser 
spectrum is 2 nm and mode jumping Ad is 5 to 10 nm. When the center 
wavelength shift due to a mode jump becomes large (5 nm to 10 nm), 
the system performance is limited by the burst errors caused by mode 
jumps. Two dotted lines indicate the performance limitation caused 
by mode jumps. For both figures, two solid lines indicate the fiber 
attenuation limitations; one is 0.5 dB/km and the other is 1 dB/km. 
The dashed lines indicate the performance limitations caused by the 
mode-partition noise when the mode-partition factor, k, is 1; the dotted 


SINGLE-MODE FIBERS 1927 


line is the performance limitation of the burst-errors caused by mode 
jumps. The chain line indicates the bandwidth limitations caused by 
laser spectral width. As we see in Figs. 4 and 5, the fiber bandwidth 
limitation is no longer dominant in comparison with mode-partition 
noise or mode-jump penalties. The difference between Figs. 4 and 5 is 
due to the effect of mode jumps. This shows that when we use laser 
diodes with narrow spectra, the mode jumps become the dominant 
effect. 


IV. MAXIMUM TRANSMISSION CAPACITY 


Laser diodes have about 1- to 2-GHz modulation bandwidth. The 
following criteria are used to maximize transmission capacity by either 
increasing signal speed or using wavelength multiplexing. 

(it) Total data rates are to be n X 274 Mb/s (n = 1, 2, 4). 

(ti) For wavelength multiplexing, the separation of wavelengths is 
25 nm. Since InGaAsP/InP laser diodes have a wavelength tempera- 
ture dependency of 0.5 nm/°C,” we assume stabilization of the tem- 
perature is within +10°C. We also assume that the initial variation of 
laser wavelength is within +2.5 nm, and that the separation band 
required to suppress the interference from a neighboring channel is 10 
nm. Therefore, a total value of 25 nm becomes the separation of 
neighboring wavelengths. 

(iit) The transmission distance and bit rate for each wavelength- 
multiplexed channel is the same. 

(tv) The laser spectrum half-width is 2 nm and mode jumps are less 
than 3 nm. 

(v) We ignore the insertion losses of the wavelength multiplexing 
and demultiplexing devices. 

(vi) We also ignore the mode-partition noise caused by the optical 
filter of wavelength multiplexer, which must be carefully designed to 
minimize that noise. 

Figure 6 shows the total capacity and repeater spacing. There are 
two cases: one for 0.5 dB/km, and the other for 1-dB/km fiber loss. 
The solid line is for a basic transmission rate of 274 Mb/s. The small 
numbers on each dot are the orders of multiplexing. The dashed lines 
are the cases of 548 Mb/s. The chain lines are the cases of 1096 Mb/s. 
Each number (1 - - - 5) indicates the number of wavelength multiplexed 
channels. There are two dominant performance limitations: (z) loss 
limitation, and (iz) mode-partition noise limitation. In loss limitation, 
the wavelength multiplexing technique is advantageous to increase its 
capacity. In the mode-partition noise limitation, increasing the bit-rate 
of single channel is a more powerful way to increase capacity and 
repeater spacing. 

Once fiber attenuation and repeater spacing are given, what bit rate 
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Fig. 6—Maximum capacity and repeater span. 


is the best choice? Let us choose a fiber attenuation of 0.5 dB/km and 
the repeater spacing of about 60 km. A 274 Mb/s-multiplexed system 
can achieve its maximum capacity of 3 xX 274 Mb/s with three- 
wavelength multiplexing. The 548 Mb/s system and the 1096 Mb/s 
system can achieve maximum capacities of 1096 Mb/s and 2192 Mb/s, 
respectively; further wavelength multiplexing reduces the repeater 
spacing to a much lower span than 60 km. Thus, the maximum capacity 
can be achieved by 1096 Mb/s systems using two-wavelength multi- 
plexing. The choice of bit rate, 274 Mb/s or 1096 Mb/s, will be 
determined by availabilities of components and transmission capacity 
demands. 


V. CONCLUSION 


We have discussed performance limitations for single-mode fiber 
systems and introduced formulas for several performance limitations. 
Also, in Fig. 6, we have shown that wavelength multiplexing increases 
transmission capacity. We conclude that 

(t) For more than 1 Gb/s capacity, system performance is limited 
by fiber dispersion, especially mode-partition noise. In this case a 
single high-bit-rate channel is the better choice to achieve the maxi- 
mum capacity and longest repeater spacing. 

(ti) For less than 1 Gb/s capacity, wavelength multiplexing be- 
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comes the most effective technique to increase total capacity, to 
achieve longer repeater spacing, and to lower the bit rate for each 
channel. The lower channel bit rate eases component requirements. 

(iit) Mode-partition noise and burst errors caused by mode jumps 
severely limit performance. 

Results of the bit-rate-distance product we discussed here are far 
less than the theoretical 400 to 3000 Gb/s-km that can be expected 
from single-mode fibers. To achieve such a high-capacity single-mode 
fiber system, further research effort is required, especially in the areas 
of: (z) fiber and cabling improvements, (ii) fabrication, stabilization, 
and mode behavior of laser diodes, (iii) high-sensitivity receiver design 
using low-noise components, (iv) a high-gain detector such as InGaAs- 
APD or InGaAs-photo-FETs, and (uv) high-speed integrable compo- 
nents such as GaAs-MESFETs, and Si-MOSFETs. 
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Error Probability of Partial-Response 
Continuous-Phase Modulation with Coherent 
MSK-Type Receiver, Diversity, and Slow 
Rayleigh Fading in Gaussian Noise 
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(Manuscript received January 21, 1982) 


This paper considers a class of constant-amplitude modulation 
schemes with good spectral main lobe and tail behavior. Detection is 
assumed to be coherent and the receiver is of offset-quadrature type, 
L.e., minimum shift keying (MSK) type, consisting of a linear filter in 
each quadrature arm followed by simple processing. Analytical error- 
probability formulas are derived for various modulation schemes and 
receiver filters for ideal diversity with maximal-ratio and selection 
combining. Independent slow Rayleigh fading in Gaussian noise is 
assumed and several numerical examples are given. Asymptotic 
behavior of the error probability for large signal-to-noise ratios is 
derived, and the relationship between the degree of smoothing in the 
partial-response continuous-phase modulation and the asymptotic 
error probability is shown for fading channels with and without 
diversity. 


1. INTRODUCTION 


The transmission of information over radio channels with multiple 
changing-propagation paths is subject to fading, i.e., random time 
variations of the receiver signal strength. For digital transmission over 
a fading channel, the time variations cause a varying error probability 
for all types of digital-modulation methods. 

The application of bandwidth-efficient constant-amplitude modula- 
tion schemes to digital land-mobile radio has been considered in 
several papers.’ Other investigations have shown that transmission 
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over such channels is subject to fading.®” In this paper we will consider 
slow Rayleigh fading where the density function for the signal-to-noise 
ratio (s/n) is 


‘Ne = enV, (1) 


where I is the average signal-to-noise ratio. We define slow fading as 
the time-varying s/n y, which is approximately constant over several 
transmitted bits (symbols). It will be seen below that the type of 
detectors considered for the modulation schemes in this paper operate 
over at most that number of symbols. 

By combining a number of channels with independent Rayleigh 
fading, the density function for the resulting signal-to-noise ratio (1) 
can be improved. This is called diversity. Thus, with decreased prob- 
ability of very low signal-to-noise ratios, the average error probability 
is improved by means of diversity. 

This paper presents analytical, easy to use, bit-error probability 
formulas for smoothed continuous-phase constant-amplitude modula- 
tion with a simple coherent receiver and diversity. From these calcu- 
lations we conclude that the increase in error probability owing to the 
degree of smoothing is smaller for a fading channel than for a nonfading 
channel. In Section IV we show an example of the difference between 
quadriphase shift keying (QPSK) and the schemes 3RAC and 4RC, which 
depict smoothed constant-amplitude modulations with narrow spectral 
main lobe and low spectral tails. The nonfading channel is approached 
with large numbers of diversity branches. The fading channel is 
approached with few branches of diversity. The gradual change from 
the two extreme cases is readily apparent. 


1.1. Probability of error 


Coherent transmission is considered. For quadriphase shift keying, 
4-PSK (QPSK), there is a special case of the more general error-proba- 
bility formula considered below. For QPSK and Binary Phase Shift 
Keying, 2-PSK (BPSK) the error probability for the Gaussian channel 
with ideal transmission and optimal detection is 


P(y) = Q(v2y), (2) 


where y = E/N, is the s/n (per information bit), Ey is the energy per 
information bit, and N, is the (one-sided) spectral density for the 
additive white Gaussian noise. The function Q(-) is the error function 
associated with the normal distribution, i.e., 
1 co 
Ox) =—— (at, (3) 


Van J, 
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For the family of modulation schemes considered in this paper, the 
error probability P(y) for the Gaussian channel is a linear sum of Q- 
function terms. We obtain the average bit-error probability for the 
fading case by averaging P(y) over the density function in eq. (1), ie., 


p={ f(y)P(y)dy. (4) 


The averaging is performed with different density functions for the 
diversity cases below with different combiner strategies. The problem 
dealt with in this paper is finding the average bit-error probability (4) 
for the family of modulation schemes and diversity cases described 
briefly below. 

The solution to the error-probability averaging in (4) is of interest 
in cellular systems with frequency reuse where cochannel interference 
is the main interference source and where diversity is used to improve 
the signal-to-noise density function.**’ When we consider so-called 
time-division retransmission schemes, it is realistic to consider space 
diversity with more than two diversity branches.*° 


1.2 Modulation schemes 


Before we address the fading and diversity problems, we will give a 
short description of the modulation schemes considered in this paper. 
A family of binary constant-amplitude digital modulation schemes is 
defined by the transmitted signal 


S(t, a) = | cosl2nfot + o(¢, a)) (5) 


where EF = E;, is the symbol (bit-) energy, T' is the symbol time, and f, 
is the carrier frequency. The information-carrying phase is 


o(t,a)=27h } aig(t — iT), (6) 
where @ = +++ Qn-2, Qn-1, On; An+1, An+2... IS a Sequence of independent 
binary symbols a; € {—1, +1} (we will consider only binary schemes 
here) and h is the modulation index. The phase response is defined by 


q(t) = [ &(7)dr, (7) 


where g(t) is a time-limited pulse defining instantaneous frequency.’*"* 


The above family of schemes is considered in this paper only for 
modulation index h = 1/2. For this case, such modulation schemes as 
minimum shift keying (Msk)—or fast frequency shift keying (FFSK),' 
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tamed-frequency modulation (TFM),” and Gaussian MSK (GMSK)**—are 
contained in the family (5). Different modulation schemes are obtained 
by changing the pulse shape g(t). Thus, for MSK 


0 t<Q, t>T 


g(t) = a | (8) 
oT 0OstsT, 


and for TFM”?” 


g(t) = “g(t — T) + 2g.(t) + g(t + T)], (9) 


sin (=) 
1 T qm 


Bolt) = 7 TA 24 
(7) 
Osi at _ at at a nt\” . at 
sin a Pp cos 7 a sin 7 
————— ae 7 
T 


The TFM pulse is infinite in time. Below we consider time-truncated 
versions of the pulse (9). 

Raised-cosine pulses of various lengths LT'-” are also considered. 
For this case 


1 2Qrt 
— 1- —_ <t=L 
g(t) = 4 2LT | — (=)] . a 


0 otherwise. 


where 


(10) 


(11) 


The notation LRC denotes a raised-cosine pulse of length LT or a 
modulation scheme (5), (6) based on the pulse LRC, shown in eq. (11). 
Additional details on schemes based on LRC can be found in Refs. 11, 
12, and 13. 

Duobinary MSK, i.e., 


1 
WT 0=<=t=<=2T 
g(t) = (12) 
0 otherwise 


is also considered. This pulse is also denoted as 2REC.” 
Schemes based on pulses g(¢) of length larger than the symbol time 
T are so-called partial-response schemes. Controlled intersymbol in- 
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terference is introduced. This improves the power spectra’ and the 
detection efficiency under certain conditions.’””” 

Figures 1 and 2 show the motivation for considering partial-response 
continuous-phase modulation schemes with overlapping pulses g(t) of 
increasing length L. Figure 1 shows how the power spectral density 
(normalized to the total power) becomes more narrow for larger L 
values. The sidelobes are also low due to the large number of contin- 
uous derivatives in g(t).'"’”“ Figure 2 compares QPSK, MSK, and 3RC. 
Note that the raised-cosine schemes have constant amplitude. By 
changing g(t), further improvements of the spectral tails can be 
achieved.” 


1.3 Detectors 


We show in our references that the / = 1/2 schemes can be detected 
with good efficiency in an MSK detector’ or an MSK-type detector 
(modified offset quadrature receiver),’® which consists of a “matched” 
filter in each quadrature arm followed by very simple digital processing. 
In its most simple form, this processing consists of differential decoding 
in each quadrature arm’” with differential encoding employed at the 
transmitter. For MSK, the receiver in Fig. 3a is optimum. MSK consists 
of linear modulation in each quadrature arm. A matched receiver filter 
whose impulse response is a half-cycle sinusoid of length 2T is the 
filter function a,(t) and a(t) in this case. 

An MSK receiver with modified filters was proposed in Ref. 2 for 
partial response schemes in the case of TFM. For the partial response 
case with overlapping pulses g(t), the receiver shown in Fig. 3a is 
suboptimum. However, for h = 1/2 and for reasonably small L (L = 4), 
this receiver gives good results for low and intermediate signal-to-noise 
ratios.”° The error probability shown in the graphs below is for binary 
decisions based on the filter output compared to a threshold which is 
0. 

Figures 3b and 3b show two examples of the receiver filter a,(t).” 
The receiver filter a2(t) is identical to a;(t). The first example is the 
so-called sPAM (selected pulse amplitude modulation) filter,’ where 
we have used the same strategy in filter selection as was shown in Ref. 
2. Note that the filter in Fig. 3b is truncated to Nr = 4 symbols. Figure 
3c shows the so-called MIN (minimum energy signal) filter. It is shown 
in some detail in Ref. 13 that the preferable filter depends on the 
s/n . Filter SPAM is good for low s/n’s, while filter MIN is good for high 
s/n’s.'* For 3RC the difference in performance is small, though. 

Figure 4 shows the “detection eye,” i.e., the input cos{#(é, a)} to the 
filter a,(t) for the 3RC scheme. (For further details see Ref. 13). The 
eye in the other quadrature arm is the same, but offset one symbol 
interval T. 
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Fig. 1—Power spectra for binary RC schemes when h = 1/2 and T; = T. 
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Fig. 2—Power spectra for binary 3RC with A = 1/2, QPSK, and MSK. 
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Fig. 3a—Receiver structure for the modified offset quadrature receivers for partial- 
response CPM. 


Fig. 3b>—The spam filter for a binary, A = 1/2, 3RC receiver. 
Fig. 3ce—The min filter for a binary, h = 1/2, 3RC receiver. 
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TIME 


Fig. 4—Cos ¢(¢, a) eye pattern of a binary 3RC, h = 1/2, scheme. Note how open the 
eye is at the detection point (¢ = 57 in this time scale). 


1.4 Error probability in Gaussian noise 


The average error probability P for a given modulation scheme— 
given pulse shape g(t)—and for a given receiver filter a(t) = a,(t) = 
a2(t) can be evaluated exactly. For any given data sequence, the 
decision after the filter is a comparison between a Gaussian variable 
and a threshold which is zero in Fig. 3a. (For details see Refs. 13, 15, 
16, 17, and 18.) It is shown in Ref. 13 that various filters are best at 
various levels of P (various values of s/n). Note that by error proba- 
bility P we consistently refer to the average error probability associated 
with one binary decision after the receiver filter a(t). Because of the 
phase ambiguity of 7, the bit-error probability of the modulation 
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scheme, Pi, is affected by the resolution of this ambiguity.’? With 
differential decoding we have’® 


Poi = 2P(1 — P). (13) 
This will be discussed somewhat in Section V. Until then the calcula- 
tions will be done using P (before the differential decoding). 


In Ref. 13 we see that the average error probability P for the partial- 
response scheme with an MSK-type receiver is 


ae : 2 Ee 
Ee ea aol d; #) (14) 


Here EF, = E, N, is the spectral density of the one-sided Gaussian 
noise, Q(-) is the error function associated with the normal distribu- 
tion, d? is the squared Euclidean distance associated with a signal 
corresponding to data sequence number i received in the fixed filter 
a(t) (see Refs. 13 and 15 for details), and C; is the probability of that 
specific signal. There are at most m = 27*"*! different signals in (14), 
where N7 is the filter length in bit intervals (some of the distance 
values are the same for several data sequences due to symmetry. (See 
Ref. 13 for details.) 

Assuming independent data symbols with p(—1) = p(+1) = 1/2, then 





C; = 1/m. (15) 
For QPSK and MSK we have d? = 2; thus, 
2E> 
P= 
2) ‘ 


with the optimum receiver filter. 

For the general partial-response case, the sum (14) consists of several 
terms with several squared distance values d?, where one of them is 
the minimum squared Euclidean distance. Methods for the calculation 
of d?, i = 1 --- m, are given in Ref. 13. The parameters d? are 
independent of signal-to-noise ratio. They depend only on the data 
sequence, the pulse shape g(¢), and the receiver filter a(¢). 

Figures 5 and 6 show the calculated average error probability P, 
using eq. (14), for a number of modulation schemes where the receiver 
filter is the spam filter.’ The technique described in Ref. 2 was used 
for selecting the filter. Several different pulse shapes are considered 
(see Ref. 13 for details). For comparison, the error probability for QPSK 
is also shown. It is evident that the longer and smoother pulse g(¢), 
the better the power spectrum and the larger the penalty in E,/N. 


anwawnnauad tn ue, +h 
compared to QPSK. POWevey, the penalty—using, for example, 3RC— 


is only 0.5 dB at P = 107°. The penalty for TrM at P = 10° is about 1.0 
dB. For lower P values the penalty is larger. 
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Fig. 5—Error probability in detecting the phase node sequence (output of the receiver 
filter) for SPAM receivers for various binary, h = 1/2, schemes. 


The rest of the paper is organized as follows: In Section II we derive 
formulas for error probability for the partial-response continuous- 
phase modulation schemes with M branch diversity and maximal-ratio 
combining. Section III contains the corresponding results for selection 
combining, and Section IV presents some numerical results. Section V 
contains a discussion and conclusions. 


ll. MAXIMAL-RATIO-COMBINING 


First we consider the problem of calculating the average error 
probability 
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Fig. 6—Error probability in detecting the phase node sequence (output of the receiver 
filter) for SPAM receivers for various binary, = 1/2, schemes. 


= i fiy)P(y)ay, (17) 


where P(y) is the average error probability of the modulation scheme 
(with coherent detection) for Gaussian noise at the s/n y given by (14), 


Pe C:Q(Vd?y), (18) 


and where f(y) is the density function for y. Note that the receiver 
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filter a(t) operates over Nr symbols, typically smaller than 6. Slow 
fading is assumed and the averaging is performed over one “block” of 
length N7, much the same way as is shown in Ref. 20. 

The instantaneous s/n in diversity branch k is assumed to be yz, 
with equal average value for all branches. For Rayleigh fading, the y; 
have the probability density function of eq. (1). 

Assume ideal maximal-ratio combining.*’ This combiner must know 
each path magnitude and phase to perform perfect combining and 
must have the property that the output signal-to-noise ratio is the sum 
of the instantaneous branch s/n’s, i.e., 

M 


y= 2 “ye: (19) 


The random variable y has the density function®’ 


| 
eh wie eB y/l 
fy) =F (z) (m—pi° (20) 


for M-branch maximal-ratio combining, assuming independent Ray- 
leigh fading in each branch. The average receiver-output s/n is 


E{y} = MT. (21) 


References 21 and 22 show that the average bit-error probability for 
BPSK (QPSK) with coherent detection, maximal-ratio combining, inde- 
pendent Rayleigh fading, and Gaussian noise is 


=! r -1 1-3 -—2 
past Vor ta QED) te ed) 


pec itace ae Gs) —(t-1) 
+“ pe th) |} me 


where I is the average s/n per branch and where M is the number of 
branches, M = 2. Corresponding formulas are also described in Refs. 
23 through 25. For M = 1, see Ref. 7. 

The average error probability for a modulation scheme with bit- 
error probability P(y) = Q(Vv2ay) for the additive white Gaussian 
channel is given by eq. (22) with I’ replaced by aI (see Refs. 21 and 
22). 

For the case of coherent Msk-type reception of partial-response 
continuous-phase modulated signals (see Section I), the error proba- 
bility after the detection filter but before differential detection is a 
weighted sum of Q-functions, as shown in eq. (14). Thus, the bit-error 
probability for the fading and diversity case is given by . 
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i+ 14 22 Yigal eis . 
; 1!2 2 219? D) 





1-3-5. ... -(2M — 3) ar —(M-1) 
ee ROR (1+ 2) , (23) 
where C,, d?, i = 1 --- m depend on the modulation scheme and the 


receiver filter used (see Section I). Numerical calculations of (23) for 


various modulation schemes and various M are presented in Section 
IV below. 
For large s/n’s I’, the formula (22) reduces to®””! 


(2M—1)! (1\” 

SP 24 

- M\(M — 1)! (=) (24) 

It is easy to adapt this formula to the partial-response continuous- 
phase modulation case. For large s/n’s eq. (23) can be written as 

Qm—1t /1\""™ ¢ 
=~ —_____ | —__]} .)' ——___ 25 
M\(M — 1)! (=) x 9) 


mi (d?\— 
(5) 


To evaluate the asymptotic s/n difference between different schemes, 
it is convenient to write eq. (25) as 


_ (2M—-1)! 1 
~M\M—-1)! | 1 = 


P 








The special case of BPSK (2PSK) is given by m = 1, C; = 1,d? = 2. 

Thus, the asymptotic E/N, difference between two schemes, e.g., 
between QPSK (BPSK) and some partial-response CPM schemes is given 
by the factor 


eel | (27) 
From the relationships above and (17) we can draw the following 


conclusions: 
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For M = 1, the difference in F,/N,. between a partial-response scheme 
and QPSK is smaller than the difference given by din alone. All the 
d? affect the difference with equal weight C; = 1/m. For increasing M 
values it can easily be concluded from the average in eq. (27) that the 
smallest d?, ie., djin, will play an increasing role and dominate the 
difference for large M, just as for the Gaussian channel. 

These relative performance differences for various M values are 
illustrated by the numerical examples in Section IV. 


Ill. SELECTION COMBINING 

In this section we will derive formulas corresponding to eqs. (22), 
(23), and (24) for the case of diversity with ideal selection combining. 

The ideal selection-diversity combiner is defined here as a device 
that selects that diversity branch with the largest s/n for bit decisions. 
The same branch is used for all symbols over one time interval for the 
receiver filter, under the assumption of slow fading. 

Let: y, be the instantaneous signal-to-noise ratio in branch k with 
average value I’ equal for all branches. For Rayleigh fading, the y, 
have the probability density function of eq. (1). As shown in Refs. 6 
and 7, the probability density function for the ideal selection-combiner 
output s/n y is 


fo) =e = erty, (28) 


where I is the average s/n per branch and M is the number of 
branches. The average receiver-output s/n in this case is 


oem | 
k=1 


which of course increases more slowly with increasing M than the 
corresponding average for maximal ratio combining (21).%” 

First we derive an analytical solution to (17) for selection combining 
and coherent BPSK. It is then easy to apply this solution to the error 
probability for partial-response continuous-phase modulation (14). 
The probability density function (28) can be written as 


ee Ge ;(M-1\ -fon 
fy) = T >» ae (} er, (30) 


The average bit-error probability for BPSK, coherent-detection and M- 
branch diversity with selection combining is 


P= | f(y) Q(W2y)dy 
M2 = . Sy 
HS cy e ‘) | ame dy (a1) 
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We evaluate (31) for the more general family of modulation schemes 
where the bit-error probability for the Gaussian channel is P(y) = 
alt V2ay); here a is a constant <1. 

The integral above can be evaluated analytically as 


| Q(v2ay)e7 dy 








ii Z 1 -12/2 ~“14;) 
= —e''dthe ™ dy 
[ the V20 
aol Tr pe 1 (32) 
21+ 7 cates 
al 
Thus, with (31) we have P: 
M™ (-1)/ = 1 
=— f-" a . e " 1 c* 9 (33a) 
2 ey ay 1+] 
1+ 
al 
which can be written more compactly as 
(-1)/ ) 
1 M J 
P=- }' —_——-.. (33b) 
aie ee 
al 


For the general case with an MSkK-type receiver for a partial-response 
continuous-phase modulation scheme with h = 1/2, as shown in eq. 
(14), we arrive at the analytical error-probability formula 


Fe G ) 


m 1 M 
P= %¥ C,-- ¥ ——_, (34) 
int «= 2 ju i Qj 
dT 
where C;, d?,i=1---m are defined by the modulation scheme and 


by the receiver. Numerical examples for specific transmitted-signal 
formats, receiver filters, and number of diversity branches M are given 
for (34) in Section IV below. 

Next, we evaluate the dominating term in (34) for large signal-to- 
noise ratios I’. For simplicity we do this for BPSK (a = 1). This can 
then easily be extended to the general case (34), as we will see. For 
large I’s, eq. (33) with a = 1 can be written as 
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are: Ores (2k — 1) i)" 
for Scr te gen) 
Since 
M 
y (-1) o) =0 (36) 
i=0 
we have 
ee se Siexas Wed (Qk-1)/1\"™*. _./M\., 
aa a) ao ( ): - 
The quantity 
M 
S= 3 (-1) @) i? (38) 
i=0 


is related to the Stirling numbers of the second kind (see Ref. 26, 
Chapter 24 and Ref. 27, p. 33). S is 0 for k < M and 


S = (-1)".M! (39) 
for k = M.S can also be evaluated for k > M, as shown in Refs. 26 and 
27. Thus, for large I’s, the error probability behaves like 
1-8---.-.- (2M—1) 1 


Pe (40) 
The power of 1/T is the same as for M-branch diversity with maximal- 
ratio combining. The multiplying coefficient is larger, however. 

For high-average-branch s/n’s I, the asymptotic error-probability 
behavior for coherent BPSK with M-branch diversity is shown in Table 


I below. 


Table 1—Asymptotic behavior 
of average probability of error 
for large average per-branch 
signal-to-noise I for selection 
and maximal-ratio combining 
using BPSK (QPSK). 


Maximal- 
Selection Ratio Com- 
M Combining bining 
1 1 a 
4. 4r 
9 31 3 1 
8’ 161? 
3 15 1 5 1 
161? nee Oe 
4 105 1 35 1 
32 14 256 4 
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For partial response 4 = 1/2 continuous phase-modulation with an 
MSK-type receiver, the asymptotic behavior for selection combining is 


Ved yoo sate 2M —1 1 
polide OM) fd 


9 (M+1) 
. C; = (41) 





Comparing formula (41) with the corresponding one for maximal-ratio 
combining (24), it can immediately be seen that the relative asymptotic 
performances of various modulation schemes are the same for the 
selection combiner as for the maximal-ratio combiner. This is also 
illustrated by the curves in Section IV. 


IV. NUMERICAL RESULTS 


Formula (23) for maximal-ratio combining and (34) for selection 
combining are used to calculate the error probability P in the figures 
below. Again note that the error probability is that found after the 
filter in the quadrature arm in Fig. 1. 

Figure 7 shows the error probability P versus the average per-branch 
(and per-bit) signal-to-noise ratio I‘ for the modulation scheme 3RC, 
h = 44, with a receiver filter of MIN-type.’® Formula (23) was used and 
QPSK results are shown for comparison. Note that the difference 
between QPSK and 3RC grows with M. Also compare the difference in 
Figs. 5 and 6 for the nonfading-additive white Gaussian channel. As 
was predicted by the formula in Section II above, the asymptotic 
difference between QPSK and 3RC is smaller for M = 1 than for M = 
16 and for the additive white Gaussian channel. This is due to the fact 
that, for low M, low signal-to-noise ratios dominate the density func- 
tion f(y), as shown in eq. (1). For this region of y, the difference 
between the error probability for QPSK and 3RC is small for the 
additive white Gaussian channel (see Figs. 5 and 6). 

Figure 8 shows the same modulation and diversity schemes as Fig. 
7 with the exception that for 3RC a spam-receiver filter is used (see 
the introduction and Ref. 13). Notice that the differences between the 
corresponding curves in Figs. 7 and 8 are small. The MIn filter is better 
than the spam filter for high s/n’s over the additive white Gaussian 
channel (see Figs. 5 and 6 and Ref. 13). The spam filter is better than 
the MIN filter for low s/n’s. This explains the small but noticeable 
differences between the 3RC curves in Figs. 7 and 8. 

Figures 9 and 10 show the 3RC-Min and 3RC-sPAM cases for selection 


combining with QPskK results shown for comparison. Formula (34) was 
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Fig. 7—Error probability P vs average-per-branch s/n I with 3RC, h = 1/2, receiver 
with MIn-filter. (The curves are shown without differential encoding/decoding.) 


used for the computations. Note that the same relative behavior is 
present in Figs. 9 and 10 as was shown in Figs. 7 and 8. Also note the 
larger per-branch s/n required for selection combining compared to 
maximal-ratio combining for equal M.®’ 

The exact analytical formulas (23) and (34) are easy and straight- 
forward to evaluate. For example, for M = 16 and for the 3RC-sPAM 
case with a filter of length 4 binary symbols, 16x2° terms are used in 
the sums. In general, Mx2%7**! terms are summed where N7 is the 
filter impulse response length in bits.” 

Figures 11 to 14 show the error probability P versus the average- 
per-branch s/n I for an increasing number of diversity branches with 
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Fig. 8—Error probability P vs average-per-branch s/n I’ with 3RC and spam filter M- 
branch diversity with maximal-ratio combining. 


maximal-ratio combining. Again, QPSK (BPSK) results are shown for 
comparison. The same group of modulation schemes and receiver 
filters are used. Note the widening spread for increasing M, as expected. 
The corresponding case for selection combining is shown in Figs. 15 
through 17 (and in Fig. 11 for the M = 1 case). 

Figure 18 shows another way of presenting the relative error- 
probability results. This figure shows the required receiver output s/n 
(M.-T) to achieve bit-error probability 10~ as a function of the number 
of branches of diversity M/. Maximal-ratio combining is assumed. The 


nhavant ODM ODM avnlA ADDN -3ib 
modulation schemes are coherent 4itU, ost, 4G 4100 With SPAM 


filters,’ with @pskK shown for reference. For increasing M, the s/n for 
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Fig. 9—3RC with min filter and M-branch diversity with selection combining. 


the additive Gaussian channel is approached. Note the relative position 
of the curves for 2RC, 3RC, and 4RC. For further numerical results, 
see Ref. 28. 


V. DISCUSSION AND CONCLUSIONS 


The curves in Figs. 7 through 18 are calculated with the assumption 
that the receiver can resolve phase ambiguity of 180 degrees in the 
detection process.'” This is normally done by employing differential 
encoding and decoding’®”” and, likewise, for QPSK. For this case, the 
bit-error probability for the Gaussian channel is” 


Poi = 2P(1 — P), (42) 
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Fig. 10—3RC with spam-filter and M-branch diversity with selection combining. 
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Fig. 11—Error probability P vs s/n T for various selected h = 1/2, binary CPM schemes 
with selected receiver filters. On each curve the modulation format and the receiver 
filter are given. 


where P is the error probability in the symbol decision after the linear 
filter in each quadrature arm. Thus, for the fading and diversity case, 
the same relative curves are obtained as upper bounds 


Po Ss 2P - (48) 


on the bit-error probability with differential encoding/decoding. In 
absolute numbers, all error probabilities P must be multiplied by 2. In 
principle, averaging of the type in (17) can be performed numerically 
for (42) but, especially for large s/n’s and large numbers of diversity 
branches, the bound based on eq. (43) is very tight. 

The calculations of the bit-error probability for TFM with fading are 
given in Ref. 28. This lies, as expected, between that of 3RC and 4RC. 

The performance of GMSK in fading for various values of the param- 
eter B,T—affecting the length of the pulse g(t)—can be estimated by 
comparisons to the raised-cosine pulses.** Compare the spectra and 
eye patterns in Refs. 3 and 4 with those for raised-cosine pulses in 
Refs. 11 through 13. Approximate fading and diversity (M = 2) error- 
probability behavior is given in Ref. 3 for some cases of GMSK. A 
detailed comparison of raised-cosine schemes and GMSK is given in 
Ref. 28. 
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Fig. 12—Error probability P vs average-per-branch s/n F for the group of modulation 
schemes in Fig. 11. M = 2 branch diversity with maximal-ratio combining is used. 
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Fig. 13—Error probability P vs average-per-branch s/n I for M = 4 branch diversity 
with maximal-ratio combining. 
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Fig. 14—Error probability P vs average-per-branch s/n I’ for M = 8 branch diversity 
with maximal-ratio combining. 
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Fig. 15—Error probability P vs average-per-branch s/n © for M = 2 branch diversity 
with selection combining. 
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Fig. 16—Error probability P vs average-per-branch s/n I’ for M = 4 branch diversity 
with selection combining. 
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Fig. 17—Enrror probability P vs average-per-branch s/n T° for M = 8 branch diversity 
with selection combining. 
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Fig. 18—Required receiver output s/n to achieve 10~? error probability as a function 
of the number of branches for coherent detection of QPSK and for 2RC, 3RC, and 4RC 
with sPAM filters. 


This paper contains formulas for the calculations of the bit-error 
probability for a class of modulation schemes with attractive spectral 
behavior. The class contains such modulation schemes as MSK, FFSK, 
TFM, and GMSK, as defined above. A simple suboptimum coherent 
detector can be used with performance approaching the optimum 
detector. The channel is assumed to be a slow Rayleigh fading channel 
with Gaussian noise, and diversity is employed to combat multipath 
fading. We assume the receiver uses either ideal maximal-ratio com- 
bining or selection combining. Analytical formulas are derived for both 
cases, and simple asymptotic expressions for large signal-to-noise ratios 
are also derived and discussed. It is noted from the numerical calcu- 
lations and also from the asymptotic formulas that the difference in 
E.,/N. between various modulation schemes in the considered class at 
a given error probability decreases as the number of branches of 
diversity decreases. 
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Comparisons on Blocking Probabilities for 
Regular Series Parallel Channel Graphs 


By D. Z. Du and F. K. HWANG 


(Manuscript received January 19, 1982) 


We give a sufficient condition for one regular series parallel 
channel graph to be superior to another with the same number of 
stages. The main mathematical tools used for doing this are the 
recently developed results on majorization over a partial order. 


|. INTRODUCTION 


An s-stage channel graph is a graph whose vertices can be partitioned 
into s subsets (stages) Vi, V2, ---, V;, with V; and V, each containing 
a single vertex (called the source and the sink, respectively), and whose 
edges can be partitioned into s — 1 subsets Ei, Eo, ---, Es-1 such that 

(4) Edges in EF; connect vertices in V; to vertices in Vi+1, 

(tt) Each vertex in V;, 1 <i < s, is connected to at least one vertex 
in each of V;-1 and V;+1. 

A channel graph is regular if for each 7, the numbers of edges in Ej-1 
and £; coincident to a vertex in V; are independent of which vertex is 
chosen. 

A series combination of an s-stage channel graph G and a f¢-stage 
channel graph H is a union of G and H into an (s + ¢ — 1)-stage 
channel graph, with the sink of G identified with the source of H. A 
parallel combination of two s-stage channel graphs is a union of these 
two graphs into another s-stage channel graph with the source and the 
sink of one graph being identified with the source and the sink, 
respectively, of the other graph. A channel graph is series parallel if it 
is either an edge or is constructable from two smaller series parallel 
channel graphs by either a series or a parallel combination. A series 
parallel canopy is a special case of a series parallel channel graph in 
which parallel combinations are allowed only when at least one of the 
two component subgraphs consists solely of a single edge. 

Each edge in a channel graph can be in one of two states, occupied 
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or idle. In this paper, we follow Lee’s assumption’ that the states of 
the edges are independent and that each edge in E; has probability pi, 
called the occupancy for £;, of being occupied. The blocking probabil- 
ity of a channel graph is the probability that every channel—by which 
we mean a path from source to sink consisting of one edge from each 
E;—contains at least one occupied edge. An s-stage channel graph is 
said to be superior to another s-stage channel graph if the blocking 
probability of the former never exceeds that of the latter, independent 
of the occupancies for the E; (common to both graphs). 

Chung and Hwang’ showed that a regular series parallel channel 
graph (hereafter referred to as rspcg) without multiple edges can be 
uniquely represented by its degree vector. They also proved that in 
the case of two s-stage regular series parallel canopies, a necessary and 
sufficient condition for one graph to be superior to the other is that 
the degree vector of the former “majorizes” that of the latter. They 
conjectured that the same condition might also hold for rspcg’s. 
However, counterexamples to the sufficiency of the condition for 
rspcg’s were given in Refs. 3 and 4. In this paper, we give a sufficient 
condition for one s-stage rspcg to be superior to another, with multiple 
edges between two vertices allowed, by using the recently developed 
results of majorization over a partial order.”® 


Il. MAJORIZATION OVER A PARTIAL ORDER 


A set of numbers A = {a; = a2 = --- = ay} is said to be weakly 
submajorized’ by another set of numbers B = {b; = b2 => --+- = by} if 


k k 
yas > OG; foreach k=1,--+-,n. 
=1 = 


If, in addition, 


then A is simply said to be majorized by B. 

The above concept of set majorization has been extended to major- 
ization over a partial order.*® Let P = {S, >} denote a partial order on 
S, where S is a set of n elements and s;, s; € S, s; > s; indicates that s; 
is greater than s; in P. Let A = {ai, dz, +--+, an} and B= {hi, bo, ---, 
bn}, where A and B can be thought of as two sets of weights for the 
elements in S. Then A is said to be weakly submajorized by B on P if 
for every filter S; of S, 


> a= > bi, 


s,ES; s,ES; 
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where S; is a filter if s; € S; and s,—> s; => s, € S;. If equality holds for 
S; = S, then A is simply said to be majorized by B on P. 
Lemma 1. Suppose A is weakly submajorized by B on P. Then there 
exists C = {¢1, C2, +++, Cn}, where c; = 0 for all i, such that A + C is 
majorized by B on P. 
Proof: The proof is by induction on n. For n = 1, Lemma 1 is true by 
setting c, = b, — a. For general n, without loss of generality, assume 
that s, is a minimal element in P. Set cn = bn — an. Then A’ = {qu, az, 
©**, An-1, Gn + Cn} is still weakly submajorized by B on P, since for 
any filter S; containing sp, 
by b; - »> a= oS 5; - > a; = 0. 
s,ES; s;ESj sje Sj— {sn} s;€Sj— {Sn} 
Next consider the partial order P on S — {s,}. By our inductive 
assumption, there exists nonnegative C1, C2, +++, Cn—-1 such that (a; + 
C1, Az + Co, +++, An-1 + Cn-1) IS Majorized by (bi, be, «++, bn—-1). Lemma 
1 follows immediately. 
We quote a result from Ref. 5: 
Theorem 1: Let f(x1, x2, +++, Xn) be a function defined over the 
domain D. Let P = (X, —) denote a partial order, where X = {x, x2, 
+++, Xn}. Then 


f(a1, a2, -9*5 Qn) = f(hi, be, »++, bn) 


for all A majorized by B on P if and only if f is such that for every i 
and J, 


tees gees O8 OL overall XE D. 
OX: OX; 
We now generalize Theorem 1 into Theorem 2. 
Theorem 2: Let f(x1, x2, +++, Xn) be a function defined over the 
domain D such that f is monotone nonincreasing in each of its 
arguments. Let P = (X, —) denote a partial order, where X = {x1, x2, 
-++, Xn}. Then 


f(a, ae, +++, Qn) = f(b, be, -++,b,) 


for all A weakly submajorized by B on P if and only if f is such that 
for every i and J, 


Ce) ts) 
Pee ed ee overall XE D. 
OXi OX; 
Proof: 


(t) Assume f(a, G2, -++, Qn) = f(bi, be, -+-, b,) for all A weakly 
submajorized by B on P. Then, in particular, 
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f(ai, a2, 22+, Qn) = f(dr, be, -++, bn), 
or equivalently, 
—f(ai, G2, °°, Qn) = —f(bi, bo, eee y bn) 


for all A majorized by B. From Theorem 1, a necessary condition for 
this to happen is that for every i and J, 





png te 
OXi Ox; 
or equivalently 
te) 0 
of overall X ED. 
OX; OX; 


(ii) Assume that for every i and 7 


af _ af 


pee 
OX; Ox; 
Let A be weakly submajorized by B on P and let Si 6; — Ma ai = 
c = 0. From Lemma 1, there exists nonnegative C such that A + C is 
majorized by B. From Theorem 1, 


overall XE D. 


xir Xj => 





—f(ai + C1, Az + C2, +++, Qn + Cn) S —f(i, be, ++, bn). 


Since fis monotone nonincreasing in each x;, it follows that f(ai, ae, 
°°, An) > f(ai + ci, A2 + Ce, 226, An + Cn) > f(bi, bo, coe, bn). | 


lll. THE MAIN RESULTS 


Theorem 2 will be used for comparing two rspcg’s. To do this, 
however, we first have to define a partial order such that an rspcg can 
be represented as a set of weights for the elements of the partial order. 
This can be done by using the Takagi graph characterization of an 
rspcg. 

An (i, j, r) multiplex, 1 = i<j <= s, of an s-stage channel graph G is 
an s-stage channel graph formed from the union of r copies of G, with 
the copies being merged into a single copy from stage 1 to stage i and 
from stage j to stage s. A channel graph is called a Takagi graph*” if it 
can be obtained as a multiplex of a smaller Takagi graph, where the 
smallest Takagi graph of s stages is an s-stage path. An (1, j, r) 
multiplex can also be represented by the equation m;; = r, where mj; 
is called the multiplex index and r is the value of the index. Therefore, 
a Takagi graph can be represented by a set {m;; = k} called a multiplex 
set. Figure 1 illustrates how the Takagi graph {ms = 3, 
Mz, = 2} is constructed. it is clear that adding or deleting a mulitipiex 
index with value one has no effect on the Takagi graph. Up to this 


1968 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1982 


O-—O0—O—0O TEE ee 
m3 =3 M4 = 2 


Fig. 1—A Takagi graph. 


equivalence, it has been proved (see Ref. 4) that there exists a one-to- 
one mapping between multiplex sets and Takagi graphs, regardless of 
the ordering of the multiplex indices in the set. Furthermore, it is 
straightforward to verify that for an rspcg the product of all the values 
in its multiplex set equals the total number of distinct channels. 

Let mi; and m,, denote two multiplex indices. Then mj; is said to 
CrOSS Mpq if 1 << pp <j <q, and to contain mp, if i = p<qxjJ. The 
following has been proved in Ref. 10: 

Theorem 3: A channel graph is an rspcg if and only if it is a Takagi 
graph without crossing multiplex indices. 

We define a partial order P,, on the set of multiplex indices {m,;: 
1<i<j<s} by: m;— m,, if mi; contains mp,. Then the multiplex set 
of any s-stage rspcg can be considered as a set of weights for the 
elements of P,, (if mi is not in the multiplex set, we define mj; = 1). 
For a given multiplex set M, we define Mi; to be the subset of M 
consisting of all multiplex indices contained by m,;. We also let P,; 
denote the partial order P restricted on M;;. For fixed occupancies p,, 
P2, +++, Ps-1, let B(M) denote the blocking probability for the Takagi 
graph with multiplex set M. Then, from Theorem 3, we have 


B(Mi) = {} = =p) Lp BMI} 
leLiy MygENij 


where Li; = {l:mzi41 = 1, mij > Mii41, but there does not exist mu, > 
1 such that my — muy — mii4i} and where Nij = {Mpq:Mpq > 1, 
Mij > Mpq, but there does not exist mi, > 1 such that mij > mw > 
Mpq}. We quote a result from Ref. 2: 


Lemma 2: For given constants ¢), C2, +++, Cn, all lying between zero 
and one, define 


f(Xn) = (1 = en); 
f (xe, Xk+1) °° %5 Xn) = {1 a cr 1 aa f(Xn+1; Xk+2) °°*, Xn) ]}"* 
for kR=1,2,---,n-—-1. 


Suppose the vector (In ay, In az, «++, In an) is weakly submayorized by 
the vector (In bi, In bo, «++, In b,) where In a; and In b; are nonnegative 
for alli. Then 


f(ai, G2, +++, Qn) = f(r, be, +++, by). | 
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In particular, for any w > 1 and i <j, we have f(a, a2, «++, ai, ++, 


GjW, +++, An) = f (ai, Az, +++, AW, +++, aj, +++, Qn). Therefore, we also 
have: 
Corollary: 
Of (X1, Xo, +++, Xn Of (X1, X2, +**, Xn . ; 
f(x1, x2 ) — Of (1, x2 ) op FS 
d In x; dln x; 


We are now ready to prove Theorem 4. 
Theorem 4: An s-stage rspcg with the multiplex set {mij = aij} is 
superior to another s-stage rspcg with the multiplex set {mi; = bi} if 
{In bi;} is weakly submajorized by {In aj} on Ps. 
Proof: A straightforward induction proof shows that B(M,) is mon- 
otone nonincreasing in each m; € M,,. Therefore, if we can prove that 
for every Muy > Mxy, 

0B (Mi) we dB(Mis) 

dln mw 98ln my’ 
then Theorem 4 will follow immediately from Theorem 2. 

Consider a path Z from the top of Pi; to the bottom of P,; which 
contains m,, and my. Let ri, 1 = 1, 2, ---, n, denote the value of the 
ith multiplex index on this path. Suppose we hold every other mj; 
constant except those on Z. Then B(M;,) can be expressed as a function 
of r1, ’2, +++, fn alone since all other mj; are now constants. To be more 
specific, we have 


B(rn) = (1 — en) 
and 
B(rz, Tesi, +++, Tn) = (1 — ex — B(rasi, reve, +++, Pn) J} 
for kR=1,2,---,n—1. 
From the Corollary of Lemma 2, we conclude that i <7 implies 


OB(ri, T2, +++) Tn) _ OB(ri, Pe, +++ 5 Tn) 
dln 7; > aln7; 


In particular, we have 
dB(ri, re, ret) Tn) _ OB(ri, 12, 72°, Tn) 
dln Muv > dln Mxy 


The proof is now complete by noting 
OB(Mis) _ OB(ri, r2, +++, Tn) 


te) In Muv 0 In Muy 
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and 
0B(M,,) = dBi(ni, Ta, *°*%, Tn) 


d In mxy dln my 


Define M1, = {mi; © Mi,: Li; # ¢} and define the partial order P,, 
accordingly. 

Theorem 5: An s-stage rspcg with the multiplex set {mij = ai} is 
superior to another s-stage rspcg: with the multiplex set {mi = b;;} 
only if {In bj} is weakly submajorized by {In aij} on P, (associated 
with the {aj} set). 

Proof: Consider two s-stage rspcg’s A and B with multiplex numbers 
{mi = aij} and {mi = bi}, respectively. Suppose there exists a filter 
M Cc M,, such that 

> In ay < yy In bj. 


m;EM m,EM 


Consider a set of occupancies p, D2, «++, Ds—1 Such that p, = 0 if mzpei 
is contained by any m;; not in M. Clearly, if all edges from stage 7 to 
stage 7 are idle, then we can set aj; and 0,; to 1 without affecting the 
blocking probabilities of A and B. But now, owing to the assumption 


XY Inajy< ¥Y Indy, 


myEM m;EM 


the product of all aj; in A is less than the product of all b;; in B, or 
equivalently, there are fewer paths in A than in B. But it is well known 
that when the occupancies of all edges approach one,’ then the blocking 


<s =e 


(b) 


M13 


M12 M93 


(c) 
i Pa 
(d) (€) 


Fig. 2—Graphs for Example 1. 
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Fig. 3—Graphs for Example 2. 


probability of a channel graph with fewer channels exceeds the block- 
ing probability of a channel graph with more channels. Therefore, A 
cannot be superior to B. Hf 


IV. EXAMPLES 


Example 1. Figure 2(a) shows the Takagi graph A = {712 = m3 = 2, 
m3 = 4}. Figure 2(b) shows the Takagi graph B = {m2 = 3, mo3 = 
5}. Figure 2(c) shows the partial order P13. Figure 2(d) shows the 
weights of A on Pi3. Figure 2(e) shows the weights of B on P\3. 

It is easily seen that Mi; has only four filters, {7113}, {712, mis}, 

{m23, M3} and {mi2, Mo3, m13}, and product of the weights of A is 
greater than that of B in every case. From Theorem 4, the first graph 
is superior to the second graph. 
Example 2. Figure 3(a) shows the Takagi graph A = {m5 = 4, mi3 = 
m3, = 2}. Figure 3(b) shows the Takagi graph B = {my = 3, m5 = 
5}. Figure 3(c) shows the partial order P,;. Figure 3(d) shows the 
weights of A on P,5. Figure 3(e) shows the weights of B on Ps. 
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Consider the filter M = {mu4, m45, M35, M15, M25}. The product of 
the weights of A on M is 8 while the product of the weights of B on M 
is 15. Hence, A is not superior to B. Note that A can still be preferable 
to B (or B preferable to A) in many other senses. But one does not 
dominate the other as far as the strong property of superiority is 
concerned. 


V. CONCLUSION 


Channel graphs, of which regular series parallel channel graphs form 
an important subclass, have been extensively used in modeling and 
analyzing blocking probabilities of switching networks. A popular 
concept in comparing the blocking characteristics of two channel 
graphs is to see whether one is superior to the other under arbitrary 
traffic loads. We give a sufficient condition for superiority in comparing 
regular series parallel channel graphs. 
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On the Physical Limits of Digital Optical 
Switching and Logic Elements 


By P. W. SMITH 
(Manuscript received February 5, 1982) 


In this paper we identify and discuss some fundamental physical 
mechanisms that will provide limits on the speed, power dissipation, 
and size of optical switching elements. Illustrative examples are 
drawn primarily from the field of bistable optical devices. We com- 
pare the limits for optical switching elements with those for other 
switching technologies, and present a discussion of some potential 
applications of optical switching devices. Although thermal effects 
will preclude their wide application in general-purpose computers, 
the potential speed and bandwidth capability of optical devices, and 
their capability for parallel processing of information, should lead to 
a number of significant applications for specific operations in com- 
munication and computing fields. 


|. INTRODUCTION 


A number of recent developments have increased the interest in 
digital optical signal!-processing devices and techniques. Laser tech- 
nology has now advanced to the point that lasers are being used in 
consumer electronics. Optical fiber communication systems are being 
widely installed. Integrated-optics spectrum analyzers have been de- 
veloped. 

In the research stage it has been shown that optical fibers can be 
used to transmit information at rates approaching 1 'THz.’” This rate 
is much beyond the capabilities of any presently- known electronic 
light detector. Thus, to utilize this information-handling capacity, 
some form of optical signal processing will have to be performed before 
the light signals are converted to electronic ones. 

Low-power integrated-optics light switches® and low-energy inte- 
grated-optical bistable devices* capable of performing optical logic 
have been demonstrated. It is tempting to propose that such digital 
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optical switching elements be used to construct high-speed computers 
as well as repeaters and terminal equipment for optical communica- 
tions systems. To examine these possibilities we need to understand: 
(t) What are realistic possibilities for speed, power dissipation, and size 
for optical switching elements? and (iz) What are the fundamental 
limits imposed by the physics of the nonlinear interactions, and by the 
available optical materials? 

Previous studies of these optical device limits have been made by 
several authors. The pioneering work of Keyes’ examined several 
nonlinear optical processes and concluded that thermal considerations 
imposed severe limits on the use of optical logic elements. A similar 
assessment was made by Landauer.® A more optimistic conclusion was 
reached by Fork,’ who suggested that by using suitable interactions 
and resonant structures, competitive optical elements could be real- 
ized. Recently Kogelnik® has examined the role of integrated-optics 
devices and has concluded that they may well be most useful in 
performing functions that cannot be provided by other technologies. 

In this paper we will attempt to provide a perspective on the ultimate 
limits of optical switching elements, and the areas in which one might 
expect optical signal processing to offer a significant advantage over 
other technologies. 


ll. BISTABLE OPTICAL DEVICES 


In this paper we will draw examples from the field of bistable optical 
devices. This is to some extent due to a bias of the author, as he has 
' worked on these devices for several years. However, in many respects, 
bistable optical devices are the most basic binary optical systems, and 
they have a demonstrated capability for low-energy latching operation. 
In addition, these devices are extremely versatile and can function as 
optical limiters, differential amplifiers, and optical logic elements. 
Their transmission can also be controlled by another optical beam 
creating an “optical triode.” 

A generic bistable optical device is shown in Fig. 1. It consists of a 
Fabry-Perot resonator containing a nonlinear optical material. This 
nonlinearity can be either a saturable absorption (an absorption that 
decreases with increasing light intensity), or a nonlinear refractive 
index (a refractive index that increases or decreases with increasing 
light intensity). The bistability arises from the simultaneous require- 
ments that the intensity of light inside the resonator, and thus the 
transmitted light intensity, depends on the resonator tuning (or the 
loss in the resonator), and the resonator tuning (or the loss in 
the resonator) depends on the intensity of light in the resonator. 
The resonator transmission exhibits optical hysteresis, as shown in 
Fig. 1b. 
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Fig. 1—(a) A generic bistable optical device. (b) Typical output power characteristic 
for a bistable optical device. 


Optical bistability has been demonstrated using both intrinsic de- 
vices, which use materials with an intrinsic optical nonlinearity, and 
hybrid devices, which use a detector and an electrooptic modulator to 
create an artificial nonlinear medium. 

Many types of bistable optical devices have been developed and 
studied. Small, integrated-optics hybrid devices have been operated 
with less than a picojoule of optical energy.” Two-dimensional arrays 
of bistable elements using a liquid-crystal light valve have been dem- 
onstrated for image-processing applications."° Nonresonant devices 
have been developed that use no Fabry-Perot resonator and thus have 
a broad frequency response and can switch very rapidly with a suitably 
fast-responding nonlinear material.’ 


ill. SPEED AND POWER LIMITATIONS 


In this section we will discuss the fundamental physical mechanisms 
that limit the performance of these devices. Although we will specifi- 
cally consider bistable optical devices, the results will be generally 
applicable to any passive digital optical switching elements. 

The switching speed of a bistable optical device is limited by the 
buildup time of the resonator, and by the response time of the nonlin- 
ear medium. In principle, the resonator response time can be made 
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negligible by reducing the length of the resonator, or by using a 
nonresonant configuration. The ultimate limit is set by the response 
time of the nonlinearity. Materials exhibiting strong electronic nonlin- 
earities are known with response times of <10~* seconds. To operate 
a device with such a short response time, however, requires high light 
powers. 

The switching power and switching speed of a bistable optical device 
are not independent. For example, if the response time of a given 
device is dominated by the resonator buildup time, the response time 
can be reduced by a factor of two by halving the length of the 
resonator. However, as only half the length of nonlinear material is 
now available, twice the switching power must be used to reach the 
switching threshold. 

R. W. Keyes’ has discussed several physical processes that limit the 
switching power and speed of optical devices. For any (nonreversible) 
switching operation, it can be shown that a minimum energy of the 
order of kT must be dissipated (2 is Boltzman’s constant and T is the 
absolute temperature). Quantum mechanical considerations lead to 
the assertion that a switching operation must dissipate at least h/r of 
energy (A is Planck’s constant and 7 is the switching time). These 
limits are shown in Fig. 2, which is a plot of the power required for a 
switching operation as a function of switching time, i.e., the time for 
which this power must be applied. The frequency label on the horizon- 
tal axis is appropriate if switching is being done repetitively so that a 
switching time limit implies a limit on the data rate. 

Keyes has discussed the limitations imposed by the heat dissipated 
in a switching element. For continuous operation, this heat sets an 
upper limit on the achievable switching rate (a higher rate would result 
in an unacceptable temperature rise in the device). The region affected 
by such thermal considerations is also shown in Fig. 2, assuming a 
value of heat transfer coefficient (100 W/cm’) that is appropriate for 
liquid-cooled elements and a maximum acceptable temperature rise of 
20°C. In many cases a lower temperature rise may be required because 
of the rapid refractive index change with temperature exhibited by 
most optical materials. It is important to note that a switching device 
can operate in this “thermal transfer” region provided that it is 
operated at less than the maximum repetition rate, or that not all of 
the switching power is dissipated in the device. 

Keyes considered the case of an optical switching device that oper- 
ates by absorbing light that saturates an atomic transition and changes 
the optical properties of the material. To achieve appreciable satura- 
tion, the condition 


ol > hc/Atp (1) 
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Fig. 2—Limitations on optical switching devices. The frequency scale (at bottom) 
applies for repetitive switching. The heavy lines indicate limits on optical switching 
devices for the case of A = 0.5 ym (solid lines) and A = 5 pm (dashed lines). 


must be satisfied. Here, o is the peak absorption cross section of the 
transition, IJ is the light intensity, A is Planck’s constant, c is the 
velocity of light, \ is the wavelength of the light, and rp is the decay 
time of the transition. From the expression for the Einstein stimulated 
emission coefficient, we can write 


- An’ |n/?T. 


3e0hA 2) 


where || is the dipole moment of the transition, T is the inverse line 


width of the transition, and € is the vacuum permittivity. 
The maximum intensity for a given input power is obtained in a 
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waveguide geometry for which the input power, P, is given by 


P~IW (3) 
(see Section IV). Combining eqs. 1, 2 and 3 we obtain: 
3e0h7A7c 
P>——.—_-.. . 4 
An”|p|’Totp ( ) 


To evaluate eq. 4 we will assume a large dipole moment for an atom 
given by 


|u| = edo, (5) 


where e is the electronic charge and dp is the Bohr radius. We will 
further take the response time, 7, of the switching element to be 


T=TD™ T2. (6) 


(T2 cannot be less than 7p, and a larger T2 implies a larger switching 
energy.) With these assumptions we find 
2 


P>12x10"%x ua W, (7) 
T 


where A is in ym and 7 is in seconds. This limit is plotted on Fig. 2 for 
\ = 0.5 pm and A = 5 pm and labeled “Absorptive Nonlinearity.” Two 
points should be noted. First, Keyes shows that a similar limit should 
also apply for the case of a second-order nonlinear effect [x process] 
or for the case of a nonlinearity based on self-induced transparency. 
Second, it has recently been shown that it is possible to find systems 
in which one can use excitonic resonances” to obtain effective dipole 
moments appreciably greater than ea. An example using such a 
system is described in Section VI. 

A different limit is found for the case of a third-order nonlinearity 
[x process]. Let us consider a material exhibiting an optical Kerr 
effect, i.e., the refractive index has a term proportional to the light 
intensity. To obtain an appreciable effect, we require a change in phase 
shift through the medium 


Ad > 7. (8) 


If the refractive index change is m2, where nz is the optical Kerr 
coefficient, then eq. 8 becomes 
2anol? Ss 
ea 
where /is the length of the element. The delay time is governed by 
the length 





(9) 


T = not/C, (10) 
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where Mo is the (linear) refractive index of the material. Combining 
eqs. 9, 10, and 3, we obtain 


A3 
No (11) 





2necr- 


Note that the dependence of P on 7 is different from that in eq. 7. 

How can we estimate n2? We can argue that the nonlinear refractive- 
index term should be of the order of unity for light fields of the order 
of the atomic fields. Thus, we could write 





Nol = No (12) 
for light fields of the order of e/aé. Now 
2 

I= come (13) 


where E is the electric field strength of the light. We combine eqs. 12 
and 13 to obtain 
4 
cl (14) 
ce 

From eq. 14 we obtain ne = 2.9 x 10°’[W/cm?]"'. Much larger 
electronic nonlinearities are known, however. The polydiacetylene 
PTS has the largest-known value;” it is nz ~ 6 X 10°°[W/cm?]*. The 
reason for this large value is that the electrons are relatively unconfined 
along the chain axis of the PTS molecules. Thus, effective distances 
much larger than ap are encountered, and as nz « a*, nonlinearities 
much larger than our estimate are found. The limits represented by 
eq. 11 are shown in Fig. 2 for A = 0.5 wm and A = 5 pm and labeled 
“Reactive Nonlinearity.” They are evaluated using the value of ne for 
PTS. 

A third limit shown in Fig. 2 is derived from statistical considera- 
tions. A number of photons large compared with unity is necessary to 
define a switching state. We have somewhat erin. taken 10° 
photons as this statistical limit, i.e., 


Pr = 10°he/d. (15) 


This limit is also plotted in Fig. 2 for \ = 0.5 um and A = 5 pm. 

A word of caution is in order here. These limits that we have 
identified are “fuzzy” in that it may not be possible to do as well as 
these limits, or it may be possible to do somewhat better than these 
limits would indicate. In general, many kT of energy will be required 
for stable switching devices. On the other hand, the use of a high- 
finesse optical resonator will lower the required switching energy. 
These limits are intended to be used as a guide and an indication of 
the underlying physical mechanisms. 
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An optical resonator decreases the required switching power at the 
expense of a reduction in the bandwidth. A bistable optical device 
utilizing a lossless material with a refractive nonlinearity has a switch- 
ing power that varies as 


P«1/F’, (16) 


where F is the finesse of the resonator. In the region where the 
switching time is limited by the resonator, 


Toff (17) 


so that the switching energy, defined as the product of the switching 
power and switching time, is given by 


Pr«i/F. (18) 


If the switching time is limited by the response time of nonlinear 
material, 7 will not depend on F, and 


Pr«1/F’. (19) 


The limits shown in Fig. 2 were computed assuming no resonator, i.e., 
for F ~ 1. We see that with high-finesse resonators, switching energies 
appreciably below the limits shown in Fig. 2 should be possible. The 
10° photon limit will still apply, however. 


IV. SIZE LIMITATIONS 


To obtain the largest light intensity for a given input power, it is 
usually desirable to focus the input light. The light can be focussed to 
a cross-sectional area of ~A’, but will diffract rapidly if not confined by 
some waveguiding structure. For this reason the lowest-power switch- 
ing devices are likely to be those in which the light is guided in an 
optical dielectric waveguide with cross-sectional dimensions of ~A. 
(Kogelnik® has pointed out that with a smaller dielectric waveguide, 
the guided mode will extend beyond the waveguide walls so that the 
minimum light-beam cross section will always be of the order of X. 
Light can be confined to smaller cross sections using metallic wave- 
guides, but in this case large absorption losses will result.) 

The minimum length of the waveguide (i.e., the device) will depend 
on the strength of the optical nonlinearity and on the finesse of the 
optical resonator. In many cases the linear absorption loss of the 
nonlinear medium will determine the resonator dimensions. Miller™ 
has shown that for a high-finesse waveguide resonator containing a 
material with a nonlinear refractive index and a linear absorption loss, 
the lowest switching power will occur for a length of resonator such 
that 


1-R=A, (20) 
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where R is the reflectivity of each of the resonator mirrors, and A is 
the absorption loss per pass through the nonlinear medium. If we write 
A = aé, where /is the length of the medium and a is the absorption 
coefficient, this condition requires 


atl= 7/2F (21) 
or 
¢= 1/2aF. (22) 


Thus, the length of a device that is optimized for minimum switching 
power will be determined by the absorption coefficient and the finesse 
of the resonator. If the nonlinearity is caused by the absorption of 
light, then compact, fast, and efficient elements will require a large 
value of a. Under optimized conditions the switching power, P, varies 
as 


P«l/F. (23) 


The optimum length of the device varies with FR so that the resonator 
response time, 7, is independent of F. Thus, in this case we find the 
switching energy 


Pr«l/F. (24) 


In Table I, we show recent results from the literature on bistable 
optical devices. We have taken the switching time to be the “recovery 
time” of the device, i.e., the time constant for the return to equilibrium 
in the absence of a driving signal. It is important to note that in many 
cases a device can be switched on (or off) much more rapidly by the 
application of a short, intense driving pulse. The data in Table I shows 
a wide range of switching powers and speeds. As might be expected, 
the fastest devices require the highest switching powers. The lowest 


Table I—Experiments: recent results 
Switching Switching Switching 


Power Time Energy 
(watts) (seconds) (joules) 
Bistable Fabry-Perot resonators 
CS2 3 x 10° 5 x 107" 1.5 x 10°* 
Na vapor 10°? 10° 107’ 
GaAs 2x 107? 4x 10° 8 x 10° 
InSb 10°? <5 x 1077 <5 x 10° 
Hybrid bistable Fabry-Perot resonator 
LiNbO; 10° 5 x 10° 5x10" 
Bistable liquid-crystal matrix 
Hughes liquid crystal light valve 5x 1077 4x10? 2x 10° 
Nonlinear interface 
Glass—CS, 2x 10° 2x 10° 4x 107 
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reported switching energy is for a hybrid, integrated-optical bistable 
device. This low energy is possible because of the very large effective 
nonlinearity created by the electrooptic modulator with electrical 
feedback. Although for some applications long switching times or large 
switching powers are acceptable, in general one wishes to minimize 
each of these parameters. How much could we improve present bi- 
stable devices by shrinking device dimensions to provide shorter transit 
times and higher light intensities for a given input power? 

We have extrapolated current experimental results to A?/n? cross- 
section waveguide devices with a length adjusted for minimum switch- 
ing energy. We have assumed a finesse of 30 for the Fabry-Perot 
resonator and have considered the response time to be the undriven 
recovery time of the device. The results are shown in Fig. 3. It is 
interesting to note that with known devices and materials it should be 
possible to approach rather closely the fundamental limits for optical 
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Fig. 3—The points represent performance limits extrapolated from current experi- 


mental values for waveguide Fabry-Perot resonators containing the polydiacctylene 


UUs AAS 
PTS, the semiconductor GaAs, the semiconductor InSb, and a hybrid device using the 
electrooptic crystal LiNbOs. 
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devices; however, as we see in Table I, present laboratory devices are 
not yet developed to the point where they are close to these limits. 


V. COMPARISON WITH OTHER SWITCHING TECHNOLOGIES 


How do these projected results and limits that we have derived 
compare with those for other switching technologies? In Fig. 4 we 
show how the limits for bistable optical devices compare with the best 
reported values for two well-established switching technologies— 
semiconductor electronic devices, and Josephson devices. It is also 
interesting to see how these devices compare with a biological switch- 
ing device—a neuron. 

It is clear that in the 10~° through 10~"' second region, one cannot 
hope to switch with substantially less power than that required for 
semiconductor electronic devices, and appreciably lower switching 
powers are possible with Josephson technology. In the 107” through 
10~* second region, however, optical devices appear to have no com- 
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Fig. 4—A comparison with the operating range of two other switching technologies: 
semiconductor electronic devices, and Josephson devices. The operating point for a 
biological “switch”—a neuron—is also shown for comparison. 
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petition. This unique capability for sub-picosecond switching is one of 
the most exciting aspects of optical switching technology. 

The switching power required in this short-time region puts the 
operating point well within the “thermal transfer” region discussed 
earlier. For this reason, it does not appear feasible to design a high- 
speed, general-purpose digital optical computer. However, for many 
applications these thermal limits may not present severe problems. 
Two points are worth noting. First, devices using a reactive nonline- 
arity do not depend on the absorption of the incident light. Thus, most 
of the switching power is transmitted by the device and the power 
dissipated is much less than the power required for switching. Second, 
for some applications, fast switching operations are required at rela- 
tively low duty cycles. In both cases the temperature rise in the 
switching elements will be much lower than the maximum value used 
in computing the “thermal transfer” region boundary. 

There are many other factors that relate to the choice of a switching 
technology that cannot be shown on a power-time plot. In many cases 
it is desirable to perform some signal-processing operation on a light 
signal, either because the incoming signal is in the form of light or 
because freedom from electromagnetic interference is desired. Optical 
switching devices typically operate at room temperature. In many 
cases, they have extremely large bandwidths and can be adapted for 
many special functions such as rapid parallel processing of information. 
For these reasons, there will be cases where optical switching systems 
will be used, even in an area of Fig. 4 in which other technologies show 
a switching-energy advantage. 


VI. ILLUSTRATIVE EXAMPLES 


We have shown that because of thermal problems associated with 
the high packing densities required for rapid operation, optical switch- 
ing elements are unlikely to be used as building blocks for a general- 
purpose computer. For certain specific applications, such as the inte- 
grated-optical spectrum analyzer recently developed for microwave 
signal processing, and optical computers for picture processing and 
pattern recognition,”’’ special-purpose optical computers have al- 
ready demonstrated their usefulness. We should also point out that 
fast optical switching devices are opening up a new time region for 
scientific studies, and picosecond spectroscopy is rapidly becoming an 
important field of research.”® 

In this section we will consider a few specific applications of optical 
switching elements, and see where an extrapolation of current tech- 
nology might lead. In many cases nonlinear materials with a suitable 
combination of properties are not currently available. The materials 
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we have chosen for these examples illustrate the wide Tange of prop- 
erties that can be obtained. 


6.1 Low-energy optical switch 


The lowest “switch-off” energy currently demonstrated is 0.5 pJ for 
a hybrid bistable device (see Table I). Extrapolation of this figure with 
a LiNbO; device with minimum waveguide dimensions and assuming 
current detector technology, we find 1 fJ operation should be possible. 
A similar limit is found by extrapolating current figures for InSb 
devices at 5 um wavelength. These figures might be reduced still 
further by using optical resonant structures related to those currently 
being employed for surface-enhanced Raman studies. (See, for exam- 
ple, Ref. 19.) 


6.2 High-speed optical switch 


The highest-speed operation will be obtained with a device utilizing 
a nonlinear material with an electronic nonlinearity. Such nonlineari- 
ties are believed to have response times in the range of 10-4 seconds. 
For minimum device response time, a nonresonant configuration 
should be used. A suitable configuration might be the self-focussing, 
bistable optical switch described in Ref. 20. By focusing the input to a 
spot size on the order of the wavelength of the light at the nonlinear 
material, adequate discrimination between “on” and “off” states could 
be obtained with a device length of 20 wavelengths. For a device using 
as nonlinear material the polydiacetylene PTS” and light of 1 pm 
wavelength, the response time would be 0.1 ps and the peak pulse 
switching power would be 100 W. This power is low enough that it 
might be reached with a mode-locked semiconductor laser diode. 


6.3 4 x 4 optical switching network 


If picosecond speed is not required, an optical distribution network 
could be formed with integrated optics technology on a LiNbOs; sub- 
strate, as demonstrated by Schmidt and Buhl.”! This example is 
somewhat different from the others we have given, in that electrical 
signals are used to control the distribution of the optical signals. 
Kogelnik® has addressed the question of the limits for the stepped AB 
couplers that comprise the switching units. He shows that the limiting 
electrical energy, Esw, needed for one switch of the optical path is 
given by 


Esw-t = 100 pJ X ps, (25) 


where 7 is the transit time through the device. A directional coupler 
switch with a switching time of 110 ps and a transit time of 3 ps has 
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already been demonstrated.” Switching times of 30 ps should be 
possible with 1 ym electrode gaps.” LiNbO; waveguides suffer severe 
problems at optical power levels higher than ~100 pW in the visible 
range. Much higher power levels are possible, however, if near infrared 
light is used. 


6.4 Image amplifier 


An optical image amplifier could be made using an array of bistable 
elements, as shown in Fig. 5a. Each element in the array is a self- 
focussing bistable element similar to that described in Ref. 20. A 
typical output characteristic for each element is shown in Fig. 5b. It 
can be seen that a weak input signal (Zs) produces a strong modulated 
signal at the output when the input light level corresponds to the 
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Fig. 5—An optical image amplifier. (a) Schematic diagram. (b) Operating character- 
istic of each element. 
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“knee” in the characteristic curve. Thus, each spatial element behaves 
as an “optical triode.” 

One possible nonlinear medium for this application would be a liquid 
suspension of sub-micron dielectric particles. As shown in Ref. 23, this 
medium exhibits a large nonlinear coefficient, although it has a slow 
response time on the order of a second. Such times may be acceptable 
for certain image-processing operations. If each element of the input 
image is focussed by the composite lens to a spot size of ~A”, with an 
aqueous suspension of quartz particles one would require an input 
power of 10 mW/resolution element at A = 0.5 ym and 1 mW/resolution 
element at A = 5 ym. Recent calculations” indicate that input powers 
of ~100 nu W/resolution element and a response time of ~1 psec might 
be possible using suitably doped GaAs as the nonlinear medium. The 
verification of these ideas must await further experiments. 


6.5 Optical time-division multiplexer and demultiplexer 


As our final example, let us consider a high-speed optical time- 
division multiplexer (or demultiplexer) that might be used to multiplex 
picosecond optical pulses into a high-capacity optical fiber, and de- 
multiplex the signals at the receiver to obtain low enough bit rates to 
allow handling by optical detectors and subsequent electronic systems. 

A multiplexer can be made from a number of triggerable switching 
elements, as shown in Fig. 6a. A trigger pulse with the proper time 
synchronization is required to multiplex pulses as shown. Each element 
could be made from a properly designed bistable optical device, as 
shown in Fig. 6b. This bistable device consists of a suitable nonlinear 
optical material in a ring resonator. The ring geometry allows separa- 
tion of the inputs and outputs. However, some polarization selectivity 
may have to be employed to avoid interference effects between the 
two input beams J‘), and I’). Let us assume here that pulses in these 
two beams are never present simultaneously in the element. The 
output intensity depends on the total input intensity 79) + I + Irrre, 
as shown in Fig. 6c. If the input pulses are of intensity slightly less 
than the critical intensity corresponding to the “knee” of the curves, 
the output in the absence of a trigger input will consist solely of I. 
However, during the time that a small trigger signal Irric is present, 
the output will consist solely of J. Because of the sharp “knee” in 
the characteristic curves, only a small I7rig is required to accomplish 
this switching. 

In a similar way, one can perform demultiplexing by using an optical 
“tap,” as shown in Fig. 7a. A similar bistable ring resonator serves as 
a triggerable “tap,” as shown in Fig. 7b; the output characteristics are 
shown in Fig. 7c. 

An appropriate nonlinear material for use in these devices might be 
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Fig. 6—Optical time-division multiplexer. (a) Overall schematic diagram. (b) Ring 
triggerable bistable element. (c) Output characteristic of ring bistable element. 


the semiconductor CdS. It has recently been shown” that large non- 
linear effects are found near the biexitonic resonance line. By using 
light at A ~ 4836A, we will obtain a sufficiently large nonlinearity to 
allow the device to operate at a pulse power level of about 1 mW with 
a time response of ~1 ps. This material requires cooling to liquid 
helium temperatures, however, and the wavelength range over which 


this operation can be obtained is small (~5A). A material such as PTS 


ALA 


will avoid these difficulties, but will increase the peak power required 
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Fig. 7—Optical time-division demultiplexer. (a) Overall schematic diagram. (b) Ring 
triggerable bistable element. (c) Output characteristic of ring bistable element. 
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to the 1 W level. As this power is only required for 1 ps, however, the 
operating energy would still be a very reasonable 1 pJ. 


Vil. CONCLUSIONS 


Having identified the physical limits for optical switching devices 
and discussed some specific examples, let us try to draw some general 
conclusions with regard to their future applications. The strong points 
of optical switching devices are: 

(t) Speed: With an electronic nonlinearity or free-carrier genera- 
tion in semiconductors, sub-picosecond switching times are possible. 

(ui) Bandwidth: With a nonresonant bistable optical device or a 
nonlinear interface, a large fraction of the visible light bandwidth can 
be used. 

(vit) Ability to treat directly signals already in the form of light. 

(tv) Capability for parallel processing: With a liquid crystal bistable 
array, image processing has already been demonstrated.” 

The weak points of optical switching devices are: 
(t) High power is required for fast switching. This will tend to 
create thermal problems unless highly transparent materials are used. 
(ii) Materials do not yet exist that have the ideal combination of 
properties for these devices. 

(tit) Theoretical and practical problems involved in waveguide and 
microresonator formation in A* volumes have yet to be overcome. 

(tv) The minimum size of an optical switching element cannot be 
reduced below a volume of about A® unless lossy metallic structures 
are used. 

The field of digital optical switching is a dynamic one in which rapid 
progress is being made. New materials and devices are being proposed 
and studied. Let us close by proposing some areas where future work 
should be directed. 

There is clearly a need for good optical quality, high-nonlinearity 
materials with low absorption coefficients. The development of such 
materials will have a great impact on future applications. Techniques 
must be developed for fabricating optical waveguides and optical 
resonant structures with dimensions on the order of optical wave- 
lengths. For many applications, the problem of regeneration of signals 
associated with ‘fan out’ is important.’ Some types of bistable lasers 
may well find use here. Finally, the general problem of the optics— 
electronics interface (i.e., making optics compatible with electronics) — 
is one that will require a good deal of attention if the most effective 
use is to be made of the tremendous potential of digital optical 
switching. 
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An Inversion Technique for the Laplace 
Transform 


By D. L. JAGERMAN 
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In this paper we use an approximation sequence defined by the 
Widder Laplace transform inversion formula to provide a practical 
method for inverting the Laplace transform. The approximation 
sequence converges uniformly and retains essential structural char- 
acteristics of the original function, e.g., nonnegativity, monotonicity, 
and convexity. Thus, we approximate a distribution function by 
distribution functions and use enhancement techniques to increase 
the speed of convergence and to capture the quality of exponential 
decay. Also, we present a practical computational method illustrated 
by examples. 


|. INTRODUCTION 


The purpose of this paper is to summarize the techniques presented 
in the paper “An Inversion Technique for the Laplace Transform”? 
and to make available a useful reference of properties of the 
‘approximation sequence,’ and a new numerical method developed 
since the publication of Ref. 1. The inversion, or approximation se- 
quence, retains the essential structural characteristics of the original 
function, e.g., nonnegativity, monotonicity, and convexity. Thus, we 
approximate a distribution function by distribution functions. For 
application to queueing theory, this may be considered quite impor- 
tant. The basic inversion sequence, together with error estimates, is 
discussed in Section I]; also, two enhancement procedures are given— 
namely, the construction of a sequence that is more rapidly convergent 
than the approximation sequence and which was not given in Ref. 1, 
and a method of accurate approximation to functions that decay 
exponentially. Section HI discusses the new numerical method, and 
Section IV presents two examples of numerical inversion along with 
controls. Except for the new material whose derivations are given here, 
all proofs can be found in Ref. 1. 
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ll. INVERSION SEQUENCE 


Consider a function f(t) for which the Laplace transform, fis), 
defined by 


fis) = | e-“f(t)dt (1) 
0 


exists for s > c and f(t) = 0(e“)(t > ©); then a sequence of functionals, 
(L,)o, is defined by 





1)" d 
Loft) = flt) =" on SFO) (2) 
One has 
lim falt) = f(0) (3) 


uniformly in every finite closed interval throughout which f(f) is 
continuous. Equations (2) and (8) constitute a variant of Widder’s 
theorem.” The sequence [f,(t)]? is called the “approximation se- 
quence” of f(t). 

The following lists some important properties of f,(¢) valid for ¢ = 0, 
n = 0; a dot indicates differentiation. Assumptions on f(t) are to hold 
in (0, ©), 


(i) asf(t)sbeacfit) sb, 
f.(0+) = f(0+), 
f(0+) = (0+), 

fr(%) = f(%), 
fr(vo) = fi), 


(zz) f(t) monotone => f(t) monotone in the same sense. f(t) com- 
pletely monotone, absolutely monotone, convex, log-convex implies 
the same property, respectively, for f,(¢). 


a = frlt) 
(wi) f(t) = sere 





monotone decreasing in n, f(t) convex = 


f(t) <= f,(t), f(t) = 0 = fa(t) monotone decreasing in n. 
(iv) If f(t)*g(t) = | f (<) g(u) sa (Mellin convolution), then 
0 


f,(t) is the approximation sequence of f(t) = f,(t)*g(t) is the approx- 
imation sequence of f(¢)*g(¢). 


Tha nointwise earrnr «£ (ft ) ic defined hv 
a MAUessscu Vy 


BAA PVASAUTV EY VEY ES LIU iz) 


én(t; f) = frlt) — F(é). (4) 
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One has 
n(t; f) ~ ae = Gil—n 1, —n — 1)t'f(t), n— 0, (5) 
ie L 


in which G,(—n — 1, —n — 1) are Poisson-Charlier polynomials.’ Set 
Bi = Gi(—n — 1, —n — 1); then the following recursion is satisfied: 





l 
Biss = ——[fi + Bi-], Bo = 1, fi =0. (6) 
n+1 
The terms of = (5) through order n~” are 
én(t; f) ~ mare aa ®t) 
: 5 OF Mt) + Oe esp (t)|, mo. (7) 
Lares (n + 1)? n+1 ; 
Some bounds for e,(¢; f) are 
-* as 
nlf; = ? 8 
| en(t Mls aya supli) | (8) 
for pointwise error, and 
lent; f) | < ansup| tf (t) | (9) 
t>0 
uniform over all ¢. The a, satisfies 
an ~ am n— oo, (10) 
™m 


It appears, therefore, that the convergence of f,(t) to f(t) is not 
rapid. In fact, from eq. (7) it is 0(1/n) and is not improved by assuming 
a higher degree of smoothness for f(f). However, one may decrease the 
error, €,(t; f), for a given n in at least two ways by: (1) modifying the 
sequence f,,(t) to obtain a higher convergence rate, and (iz) modifying 
fo(t) to improve its approximation to f(t). The first way may destroy 
the desirable properties of the approximation sequence as previously 
listed because the transformation from f(t) to the members of the new 
sequence may no longer be positive; the second way can be imple- 
mented to preserve all the desirable properties with respect to a 
function different from the original f(t). 

The following procedure improves the convergence rate but does 
not preserve positivity. Define the functionals, o,, by 


ee (2 + *) Oe (: + *) falt); (11) 
n n 


then, from eq. (7) 
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1 
pee SEEN 2) (3) (4 
3 (n + 1)(2n + 1) po) 
id 2n? + 9n+5 

8 (n + 1)?(2n + 1)? 


on(t) — f(t) ~ 


e4f(t), n>, (12) 


and hence 
on(t) — f(t) = O(1/n’). (13) 


Often, f(t) decreases exponentially fast for t — o; the inversion 
method may not capture this effect well for large ¢ if the decrease is 
very rapid. This problem is largely overcome by the following tech- 
nique, which constitutes an example of the second method of improving 
the approximation. 

If f(s) converges for s => —B(B > 0), andO0O <a SB, then f(s — a) 
exists and is the transform of g(t) = e“f (t). If gn(t) is the approxima- 
tion sequence for e“f (¢) and 


fualt) = e“g,(t), (14) 
then 
lim frolt) = f(t). (15) 


The error €n(t; f) = fre(t) — f(t) can be much smaller than e,(t; f), 
corresponding to a = 0, so that a real improvement in accuracy can 
result; of course, 


Enalt; f) = e “en(t; g). (16) 


The approximation /f,,.(f) imitates the rapid exponential decrease of 
f(t) with increasing accuracy as a approaches £. Since g,(t) is an 
approximation sequence, it possesses all of the properties given earlier, 
many of which carry over to f(t). 


lil. NUMERICAL EVALUATION 
A type of generating function may be constructed for [ f(t) ]¢. Let 





Giz, t) = 7(- 7 *) (17) 
and define [a,(t)]é by 
Glz, t) = Y anlt)z"; (18) 
n=0 
then 
/ ¢ \ 
frlt) = an ay (19) 
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. This permits the use of numerical procedures for obtaining coeffi- 
cients in a power series. The following is such a method. The notation 


27x 
eq(x) =e (20) 
will be used. Let 0 < r< 1, g prime and q > n, and define the functional 
S by 


n+1 
t 





12 eg 
Sf= 7 eg(—nyj )f [i real} (21) 


then, in terms of f(t), one has 


ntlig+1 ) r4. (22) 


Sf = frlt) + 2 frsig ( are 


For the purpose of computing f(t), Sfin the form of eq. (21) will be 
used. 

We will now assume that f(t) is bounded so one may take 
|f(t)| = A. Clearly, from eq. (22) and property 1, 





r? 
[Sf— fAlt)| SAG (23) 
and 
lim Sf= frlt). (24) 


Thus, Sf is an accurate approximation to f,(t) when r is small; 
however, if the round-off error of each term of eq. (21) is bounded by 
e, and the total round-off error by 7, then 


n+1 
t 





nse Pr, (25) 

It follows that a choice of r may be made to ensure the round-off 
error, 7, is not too large, in accordance with eq. (25). The parameter q 
may now be chosen to render the truncation error given by eq. (23) 
comparably small. 


IV. EXAMPLES 


A Fortran program for the evaluation of Sf, as shown in Eq. (21), 
was written by Bharat Doshi. Double precision arithmetic was used 
with a round-off error of 107. As a rough estimate, it was assumed 
that the computations required to form each term of the summation 
in eq. (21) resulted in a round-off error of 107"; thus the choice « = 
10-" was made. The choices 
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r= 0.83, q = 127; n = 50, : 
r= 0.91, qg = 251; n= 100 (26) 


lead to about five units error in the sixth decimal; the values of g given 
result in negligible truncation errors. These errors in computing fso and 
fioo were considered acceptably small. 

The function 


f(t) = 1/2e7* + 1/2e~* (27) 


is chosen for the first example. It represents a complementary distri- 
bution function whose values may be accurately computed as a control, 
and which shows the enhancement of accuracy obtainable by use of 
eqs. (11) and (14). Tables I and II illustrate these results. Since 


a 1 1 1 1 
Poel lee 8" v2) 
the choice a = 1 is the appropriate value to use. 

It may be observed, from Table I, that the error is halved in going 
from fso to fioo as expected from the 0(1/n) behavior of eq. (7), while 
the decrease of error from fioo to os0 is better than 1/50 as seen from 
the 0(1/n’) rate of eq. (13). Table II shows that the effect of a is 
greater for larger ¢ since the exponential term that is being tracked | 
becomes dominant. 

The second example concerns the complementary busy-period dis- 
tribution for an M/M/1.‘ The arrival rate is p and the service rate is 
p. = 1. For this case 
t 


dx 
f(t)=1-p"” | e427 (2xV/p) —, (29) 
5 x 
and 
La eee (0 + 1+ s)° — 4p], (30) 
Table I—a = O 
t f foo fico 050 
2 0.06891 0.07203 0.07048 0.06890 
4 0.00916 0.01064 0.00990 0.00915 
8 0.00017 0.00030 0.00023 0.00016 
Table Il—a = 1 
t f foo, foo, 050,1 
2 0.06891 0.06911 0.06901 0.06891 
4 0.00916 0,00916 0.00916 0.00916 
8 0.00017 0.00017 0.00017 0.00017 
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Table Ill—p = 0.9,a =0 


t , feo fico 050 
3 0.2919 0.2942 0.2931 0.2919 
6 0.1950 0.1967 0.1959 0.1950 
9 0.1514 0.1528 0.1521 0.1514 
Table IV—p = 0.5, a = 0 
t f foo fioo 050 
3 0.1803 0.1834 0.1819 0.1803 
6 0.0764 0.0785 0.0775 0.0764 
9 0.0399 0.0415 0.0407 0.0399 
Table V—p = 0.1,a=0 
t FE foo fioo 050 
3 0.0732 0.0774 0.0753 0.0732 
6 0.0088 0.0103 0.0096 0.0088 
9 0.0013 0.0018 0.0016 0.0013 
Table Vi—p = 0.1, a = 0.4675 
t f foo,a fi00,a 050,a 
3 0.0732 0.0742 0.0737 0.0732 
6 0.0088 0.0090 0.0089 0.0088 
9 0.0013 0.0014 0.0014 0.0013 


in which 1,(2xvp) is a modified Bessel function. The evaluation of the 
control was accomplished by using a quadrature procedure. The values 
p = 0.9, 0.5, and 0.1 were used. The best choice of a is, from Section II, 
(1 — Vp); however, when a is small, one could simply set a = 0, since 
there is little improvement in using the best a. For the case p = 0.1, 
one has a = 0.4675; accordingly, a comparison was made with a = 0. 
Tables III through VI sample the results obtained. 


V. CONCLUSION 


It may be observed that the best a for p = 0.1 noticeably improved 
the approximation. The performance of the o,, functionals in all cases 
created a marked improvement in accuracy; however, one should 
remember that the desirable properties of the L, functionals are not 
possessed by the o,. Nonetheless, there is a property of the o,, impor- 
tant in numerical work, which follows from eq. (11). Namely, if 6 is the 
round-off error in the computation of f,(¢) and fon(¢), then (3 + 2/n)6, 
n = 1 bounds the induced round-off error in o, (i.e., round-off errors 
are not significantly magnified). 
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An interesting use of the inversion technique follows. A function f (¢) 
is given in the form 


f(t) - | &(t, x)dx, (31) 
0 


with g(t, x) known; it is required to evaluate f(t) over a wide range of 
values of ¢. Quadrature methods are accurate, for ¢ small, or can be 
designed, for ¢ large, but do not apply equally well over all values of f, 
including the important transition region from small to large. If the 
transform 7 (s) can be obtained, then the inversion method described 
here can be used to obtain sufficiently accurate values of f(t) over the 
entire range of ¢, since eq. (9) shows that the errors, ¢,(¢; f), are 
uniformly bounded. 
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Approximate Mean Waiting Times in Transient 
GI/G/1 Queues 


By D. L. JAGERMAN 
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In this paper we give an approximation method for obtaining the 
probability the server is busy and the mean waiting time as seen by 
the nth arriving customer for the GI/G/1 queueing system. Transient 
behavior is the key issue of the method. The approximation consists 
of a pair of recursion formulae whose state variables are the proba- 
bility of delay and the mean waiting time. Any initial state may be 
prescribed for the Oth arriving customer. Programming is very easy, 
and the computation is rapid. The procedure is useful for rush-hour 
analyses and for studying the recovery of a system from temporary 
overload. 


l. INTRODUCTION 


This paper presents an approximation method for obtaining the 
probability the server is busy and the mean waiting time as seen by 
the nth arriving customer for the GI/G/1 queueing system. Thus, 
transient behavior is the key issue of the method. The approximation 
consists of a simultaneous pair of recursion formulae whose state 
variables are the probability of delay and the mean waiting time. It is 
assumed that the Oth arriving customer finds the queue in some 
prescribed state from which the successive states are computed. Nat- 
urally, for the nth arrival when n is large, the computations provide 
approximations to the corresponding eqilibrium quantities when equi- 
librium exists. The procedure, however, is not limited to queues 
possessing an equilibrium state. For methods specially adapted to 
approximating the equilibrium quantities, we refer to a paper by A. A. 
Fredericks,’ in which the approximation Ao, of that paper essentially 
corresponds to the equilibrium results obtained here; we also refer to 
Fredericks for an application to computer systems.” 

Transient analysis is of particular importance in studying the recov- 
ery of a system from temporary overload. This can occur after a short 
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downtime during which the buffers become full and the machine must 
now work off the accumulated load. Another situation calling for 
transient analysis occurs in the case of minicomputers receiving hourly 
data from an electronic switching system (ESS). The hourly rush of 
traffic requires a transient analysis to determine the buildup and falloff 
of delays. This corresponds to the general problem of rush-hour 
situations. Knowledge of the transient behavior of a queue also indi- 
cates the number of customers needed in the arrival stream until 
approximate equilibrium conditions are recovered from some tempo- 
rary overload (i.e., the relaxation time may be estimated). 

In Section II we obtain the general recursion formulae and the 
corresponding formulae defining the equilibrium results. In Section III 
we specialize the recursions to the GI/M/1 queue and present the 
exact solutions obtained using the Takaés* method as a control. We 
show the relationship between the exact probability of delay and mean 
waiting time, and also conclude that the approximate equilibrium 
quantities are, in fact, exact. We discuss a number of numerical 
examples and compare them with exact results. In Section IV we 
reduce the general recursions to the M/G/1 case. A relation is again 
obtained between the probability of delay and the mean waiting time. 
The interesting property is proved that the approximate probability of 
delay and mean waiting time as seen by the nth arrival satisfies the 
same relation as the exact quantities; thus, the approximation method 
is shown to preserve certain properties of the exact solution. We 
discuss a number of numerical examples and compare them with 
exact results. In Section V we present several numerical examples of 
GI/G/1 queues. Exact solutions, however, are derived only for equi- 
librium values. 

In all the numerical examples given, the queues start empty; how- 
ever, this is done only to facilitate obtaining the controls. Any initial 
state may, of course, be used in the recursion formulae. A synopsis of 
the important recursions and definitions of symbols is given in Section 
VI. 


li. RECURSION FORMULAE 


Let A(x) be the distribution of time between arrivals with mean 
arrival rate A, and B(x) the distribution of service time with mean 
service rate 4; the random variable €, which is service time minus 
interarrival time, has the distribution 


K(x) = [ B(x + y)dA(y). (1) 


a 
"VL 


If W,, is the waiting time of the nth arrival, then we have the recursion 
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W, = [Wr-1 + &)", (2) 
in which 
[x]* = x, x=0 
= 0, x=0 


and the €, are iid with the common distribution K(x). Let W,,(x) be 
the distribution function of W,,; then W,,(s) is defined by 


W,(s) = i e“dW,,(x), (3) 


and hence 
W,.(s) = Ee°™, (4) 


A recursion relating W,,.(s) to W, _1(s) will now be developed. From 
eqs. (2) and (4) we have 


W,(s) = Ee 8 Wa-iten)* (5) 
and hence 
W,(s) = E | eed K(x), (6) 
in which the expectation is over the distribution of W,,_:. Let 
RK(s) = ( e-“dK(x); (7) 
then the functions 
K,(s) = | e dK (x) (8) 
O_ 
and 
OL 
K_(s) = | e-*dK(x) (9) 


will be used in the development. From eq. (6), we find that 


0 
W,.(s) = Wn-1(s)K(s) + | Ee7!n-1t*l dK (x), (10) 


The recursion of eq. (10) is exact but, to obtain a simple, explicit 
recursion, an approximation will be used at this point for the distri- 
bution of W,-1. The more accurate the approximation chosen, the 


GI/G/1 QUEUES 2005 


more accurate the final recursion will be, but also, presumably, the 
more difficult to use; accordingly, a simple exponential approximation 
will be used. 


Let 
On = EW,, (11) 
and 
Jn = 1— W,(0+); (12) 
then we use the approximation 
In 
Wr(x)*1l—me *, x=, 
=0, x<0. (13) 


From eq. (13) we compute 


Jn-1 
J; -1S x 
Bier Wate ae pe a : x<0. (14) 


n-1 
—+s 





Qn-1 
Thus, eqs. (10) and (14) yield the recursion 
ee m us ys ye a 2 
W,.(s) = Wn-1(s)K+(s) + K-(0) ae, (- Ze), (15) 


n—-1 Qn-1 
—-+s 
Qn-1 


A pair of recursion relations will now be obtained for Jn, an. Since 





lim W,(s) = 1— dn, (16) 
—W1.(0) = an, (17) 

and 
lim K.(s) = 0, (18) 

we obtain from eq. (15) 
In = Ry(0) + Ina K- (- Zt), (19) 
n—-1 
and 
‘> ‘> In-1 >) 
an = | K+(0) + K-[ - . Qn-1 — K',(0). (20) 
n—1 


Equations (19) and (20) constitute the approximate recursions sought. 
If the queueing system possesses equilibrium values, then approxi- 
mations designated by J, a, W(s) are obtained from 


w\ 


J=&,(0) + IK_ (= a) (21) 
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a= [£00 as R.(- 2) a — K‘,(0), (22) 
and 


W(s) = 





R_() - = RK (- 2) (23) 
a +s 


1 
b= K,(s) 


The recursions of eqs. (15), (19), and (20), and all subsequent 
recursions derived from them, apply to arbitrary initial conditions and 
to stable or unstable queues. 


lll. GI/M/1 QUEUEING SYSTEM 


To ascertain the quality of the approximations obtained by eqs. (19) 
and (20), a control is needed. This will be provided by the explicit 
solution formulated by Takaés® for the GI/G/1 system. Accordingly, 
let 


Glz, s) = F Wals)z", (24) 
n=0 
A(s) = i e“dA(x), (25) 
0 
and 
B(s) = e *dB(x); (26) 
0 
then 
R(s) = A(-s)B(s), (27) 


and the factorization 
1 — 2K(s) = T(z, s)T_(z, s) (28) 


defines the functions I'(z, s), _(z, s). The function I;(z, s) is to be 
analytic for Res > 0 and not to vanish for R.s = 0, while I'_(z, s) is to 
be analytic for R.s < 0 and not to vanish for R.s <= 0. Further, define 
the projection operator T by 


TEe*" = Ee™*"", (29) 


in which 7 is an arbitrarily given random variable. Then the exact 
solution for the generating function, G(x, s), is 





- 1 Wo(s) 
G(z, s) => Ty(z, 8) fu aS | (30) 
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The only exact solutions to be studied in this paper will correspond 
to the queueing system starting empty; hence, Wo(s) = 1. For this case 
the projection of eq. (80) may be evaluated; we find that 


1 











Oe, 9) =e Fae, 0) Tn 
To apply eq. (31) to the GI/M/1, we consider 
R(s) = A(—s) (32) 
Ets 
and 
1 — 2K(s) = p+ 8 — NzA(—s) (33) 
uUts 
Let 6(z) > 0 be defined by 
p — 5 — pzA(5) = 0; (34) 
then we find that 
re _ b+snt+s—pzA(-s) 
1 z2K(s) = fi rigs aor Weide, = (35) 
Thus, 
Pes (36) 
Bt+s 
ofthe uzA(—s) 
T_(z, s) = See ; (37) 
and 
1 — 
T_(z, 0) = aioe (38) 
Hence, from eq. (31), we obtain 
6 puts 
G(z, Ss) = w= 2) 648 (39) 


Let Jn, Gn designate the exact values of 1 — W,(0+), and EW,, 
respectively, and let 


J(z) = dy Jnz", (40) 
| a(z) = y Giz": (41) 


n 
mv 


then, from 
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1 
J(z) = tos” G(z, ©), (42) 
Se 


0 
a(z) =— ae G(z, 8) |, (43) 
and eq. (39), we get 
0 aa (44) 
1-—z 
and 
-1_ ,-1 
=< (45) 
1-—z 


Let Ln =Jn —Jn-1, An = Qn — Gn-1 and L(z), A(z) be the corresponding 
generating functions; then elimination of 6 in eqs. (44) and (45) yields 








1 L(z) 
A(z) = -————— 
OF ae (46) 
and 
_1 sz) 
a(z) = a TAY (47) 
In particular eq. (47) implies the recursion relating j,,, an, as follows: 
1, 3a, 
An = —Jn+ Y, An-r( Jr — Jr). (48) 
B k=1 
From eq. (32), we find that 
A = BL A 
Ky(s) = ia A(y), (49) 
4 BL a a 
Kyo! pen ee 
(s) ear [A(—s) — A(u)], (50) 
and 
a x A 1 « 
Ki(0)=A(u), K4(0)=- Gone (51) 
Hence, eqs. (19) and (20) specialized to the GI/M/1 are 
~ In-1 “ In-1 a~ 
In = A(u) + Slee ds B (=) = Aw | (52) 
LL &n-1 
and 
me 1 Poe fe) kee ~ 1 « 
Qn = ) A(u) +——_— lA ( ) = Ayo | Qn-1+—-A(p). (53) 
= 1 In-1 Qn-1 LE 


EL Qn-1 
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We will now show that the limiting forms of J,, an, that is J, a are 
exact; thus J = j, a = a. Equations (52) and (53) show that 








és J é me 
J= A) +— 5 E (2) = Aw | (54) 
poe: a 
ba 
and 
A A J A 1 A 
a=; A(u) + iJ [A (2) - 4 | at—A(p). (55) 
el 7 
wa 
Hence, 
A (2) — A(p) 
Qa 1 a 
ba 
and, from eqs. (55) and (56), 
1 J 
a= re eae (57) 
and 
J = A[p(1 — J)]. (58) 


Since eqs. (57) and (58) are the exact relations defining j, a, the 
statement is proved. 

An M/M/1 will be used to start the numerical examples. For the 
M/M/1, one has (p = A/p) 


1 Jn 
J, =——— | p +————_— (59) 
ee 1+ 
N &n-1 
and 
1 ‘i 1 i p 
hh = p -————— J] @r-1 T- 1; 
1+ 1 
p 1+ x En-1 H 


in which the designation g, = J,/an will henceforth be used. 
Since Jo = 0, ao = 0, the computations are started with 
p 


Tota 
- t+ PY 


J, = K,(0) = ; (60) 





and 
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Table I—M/M/1, p = 0.2. 


n J a J a 
1 0.1667 0.1667 0.1667 0.1667 
2 0.1898 0.2176 0.1898 0.2176 
3 0.1962 0.2368 0.1962 0.2364 
4 0.1985 0.2445 0.1985 0.2440 
5 0.1994 0.2477 0.1993 0.2473 
Table II—M/M/1, p = 0.8. 
n J a J a 
1 0.4444 0.4444 0.4444 0.4444 
2 0.5542 0.7517 0.5542 0.7517 
3 0.6047 0.9959 0.6084 0.9912 
4 0.6354 1.2016 0.6418 1.1890 
5 0.6570 1.3804 0.6650 1.3578 
Table III—M/M/1, p = 2.0. 
n J a J a 
1 0.6667 0.6667 0.6667 0.6667 
2 0.8148 1.2593 0.8148 1.2593 
3 0.8719 1.8233 0.8807 1.8189 
4 0.9012 2.3727 0.9172 2.3603 
5 0.9191 2.9131 0.9362 2.8922 
1 p 
a, = —K+(0) = -—__., (61) 
w1+p 


which are exact. The value p» = 1 will be used in all examples. Tables 
I through III below present approximate and exact values for p = 0.2, 
0.8, and 2.0. 

For the next example, the renewal stream is 


1 


A(s) =—_—_-___-. 62 
(9) Ta + 25) (62) 
The recursion relations are 
1 5 + 22n-1 
Jn = =} 1 + dn-1 ———_—_—_ |, 63 
6 'G+e00+ =| 7 
and 
1 5 + 2gn-1 1 
n= 1 - n— +-. 64 
os iG | (1+ gna)(1 + bec oer’ a 
Some numerical results are given in Table IV. 
As a further example the case D/M/1 is considered. We have 
A(s) = e7°7. (65) 


The recursion relations are 
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Table IV—GI/M/1, p = 1/3. 


n J a J a 
1 0.1667 0.1667 0.1667 0.1667 
2 0.1991 0.2269 0.1991 0.2269 
3 0.2100 0.2538 0.2101 0.2534 
4 0.2147 0.2670 0.2148 0.2662 
00 0.2192 0.2808 0.2192 0.2808 
Table V—D/M/1, T = 2, p = 0.5. 
n J a J a 
J 0.1353 0.1353 0.1353 0.1353 
2 0.1720 0.1903 0.1720 0.1903 
3 0.1867 0.2179 0.1868 0.2175 
4 0.1938 0.2331 0.1940 0.2324 
5 0.1977 0.2419 0.1978 0.2410 
00 0.2032 0.2550 0.2032 0.2550 
—Tgn-) -T 
_ e —e 
In = eT + Jn1———__, (66) 
1 ri En-1 
and 
—Tgn-1 -T 
2 e —e = 
On = | e7? +——————_] an 1 + 7”. (67) 
Le 8n-1 
We find that the exact result for /, is 
n RF} 
Jn = my ee YA meee (68) 
k= Rt 


The equilibrium values for T = 2 are J = j = 0.2032, a = a = 0.2550. 
Table V presents the numerical results. 


V. M/G/1 QUEUEING SYSTEM 
For the M/G/1 queue, we have 


Rs) => Bs); (69) 
A~-Ss 
hence 
1 — zK(s) A= ~ AzB(s) (70) 
A-Ss 
Define 85(z) > 0 by 
A — 8 —AzB(s) = 0; (71) 


then 
6—sA—s-—AzB(s) 


A-s 5—s ‘ i) 


1-— zK(s) = 
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and hence 





me yee ew. 

56-—s 

5-—s 

T(z, 8) => —, 

and 

6 
T_(z, 0) =-—. 

(z, 0) x 


Accordingly, the generating function, G(z, s), is 
G(z, s) = fe SS os, 
5X — s —AzB(s) 


thus, the generating functions /(z), a(z) are 








ae A 
Na es 
and 
1 1 l11-2zp 
ate) == (F-F =) 


Elimination of 6 in eqs. (77) and (78) yields 


This may be compared with eq. (48) for the GI/M/1. 
From eq. (69), we have 


2. A. 

K_(s) =7_, BO), 

R,(s) im — [B(s) = B(a)], 
—-S& 


K,(0) =1— Ba), 
and 


“ 1 a 1 
K*(0) = x A - BO) — i 
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(73) 


(74) 


(75) 


(76) 


(77) 


(78) 


(79) 


(80) 


(81) 


(82) 


(83) 


(84) 


(85) 


2013 


Thus, the recursion formulae eqs. (19) and (20) become, for M/G/1, 


a B 
J, 24 SRO Ji, (86) 
1 + yor 
” B 1 i 
On = pai on Roi: (87) 
1 hooA 


Insofar as approximations imitate characteristics of an original, we 
may better apply the approximations. We will now show that the 
approximations J, a,, satisfy eq. (80). From eq. (86) we find that 

=aBO)n_ sei Tesedy, as 
n-1 


1+ : 
yea 


1 dJnaldn -—1+ BD] 


Qn-1 = 7 


XTnaBa) J, +1 — BO’ = 
-and, from eq. (87), 
Jn-1B(d) — Jn + 1 — BCA) 

In-1 (90) 


{4 . 
ana tT [1 — BO], 


Qn — An-1 = 


Thus, substitution of a,-1 from eq. (89) into the dexter of eq. (90) yields 


ea - = 5d (91) 
and 
Pe ee (92) 
L A k=0 


From eqs. (81) and (92), we have 
1S eg 
An — An = —A0 + Jo —- = Y (Je — Az); (93) 
DN p=1 


hence, for a stable queue, 
Je—~ In 9, k— 0, (94) 


Since j, — p, then also J; — p, k > ~». Unlike GI/M/1, however, a # 
a. The equality that occurs in GI/M/1 was to be expected since the 


atin alaatedhiits ™ 
waiting-time distribution was approximated by an exponential in a 


portion of the integration producing eq. (15), and the exact equilibrium 
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distribution is, in fact, exponential. This does not occur in M/G/1, 
however, since the waiting time is not exponential. The value for a is 
simply 
AB” (0) 
a=— 


95 
S1-,p (95) 





while solution of the equilibrium form of eqs. (86) and (87) shows that 


1 p-1+B(r 
goo ee. (96) 
B (1 — p)[1 — BA)] 
thus, a = a for small A. 
For the first numerical example, consider 





Age =, 842-2 (97) 
pts 1 
(: + 5 s) 
then 
age (t+ 0+ 5 (98) 
(2 + p)” p+ 8n-1 





an = 


p 4 1 4 
4+ 9p +—— ] an -1 + 1 —-{ 1 -—— }.__ (99 
ohoel 4 A) ons * oy v2) 
We have 


Ji 


a =1 ie 


~ (2+ p)?’ pp (2 + p) 


for the queue starting empty. Table VI presents some numerical values 
for p = 0.5. 
As another example let us consider the following M/D/1: 


_ (4 +p) “| 4 (100) 


a 1 Pe 
A(s) =——_— =e)". 
(s) 14 oe B(s) =e (101) 
The recursion formulae are 
ia e l/2 
ne a ae oy Se eee 102 
: Jott Og ee 
and 
e7W2 
n= (1—e7? + ——___] ana + 267? = 1. 
a ( e i —) Qn-1 e 1 (103) 


The values are summarized in Table VII below. 


V. GI/G/1 QUEUEING SYSTEM 
For the next group of examples, the control will provide only 
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equilibrium values. The following uses an interrupted Poisson arrival 


stream‘ defined by 


ave 3.004s + 0.913216 
s’ + 4.3085 + 0.913216" 
The service time distribution is given by 


3+ 5s 


Bis) =F 9s(8 + 28) 
Thus, 
Rs) - (0.913216 — 3.004s)(3 + 5s) 
(0.223586 — s)(4.084414 — s)(1 + 2s)(3 + 2s)’ 
va (ie 0.0516472 i 0.672764 
0.228586 —s 4.084414 —s 
- 0.182021 1.266801 
ea ee ar Te 
and 


K.(0) = 0.604288,  K%.(0) = —0.645553. 


The recursion formulae are 


pies 0 004288 Pdi a oaran acs aa ae 
Jn = 0.60 (casass +Bn1 4.084414 + gn. 


and 


On = (o.sosass ,__0.0516472 0.672764 


Ce 
0.223586 + gn-1 4.084414 + gp-1 
*Qn-1 + 0.645553. 


Table VI—M/G/1, p = 0.5. 


n J a J a 

1 0.3600 0.2800 0.3600 0.2800 
2 0.4245 0.4310 0.4266 0.4269 
3 0.4515 0.5280 0.4547 0.5174 
4 0.4666 0.5948 0.4698 0.5777 
5 0.4762 0.6423 0.4789 0.6198 
co 0.5000 0.7778 0.5000 0.7500 

Table VII—M/D/1, p = 0.5. 

n J a J a 

1 0.3935 0.2131 0.3935 0.2130 
2 0.4443 0.3244 0.4482 0.3166 
3 0.4655 0.3933 0.4701 0.3764 
4 0.4773 0.4387 0.4812 0.4140 
5 0.4846 0.4654 0.4876 0.4388 
00 0.5000 0.5415 0.5000 0.5000 
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0.0516472 0.672764 ) 


(104) 


(105) 


(106) 


(107) 


(108) 


(109) 


(110) 


(111) 


Table VIII—G//G/1, interrupted 
Poisson, p = 0.7. 


n J a 

1 0.6043 0.6456 
2 0.7123 1.1509 
3 0.7498 1.5762 
4 0.7703 1.9470 
5 0.7842 2.2769 
6 0.7947 2.5743 
7 0.8031 2.8452 
oo 0.8824 8.0160 


Table IX—Approximate and 
exact waiting-time distributions. 


x w(x) = w(x) = 
0 0.1176 0.1183 
1 0.2190 0.2212 
2 0.3014 0.3026 
3 0.3728 0.3717 
4 0.4363 0.4323 
5 0.4933 0.4863 
6 0.5448 0.5350 
7 0.5911 0.5789 
8 0.6330 0.6187 
9 0.6706 0.6547 

10 0.7045 0.6872 


Table VIII presents the numerical results. The exact equilibrium 
values are J = 0.8817, a = 8.547. 

We will use this example to illustrate eq. (23) for the approximate 
distribution function. Using the approximate equilibrium values ¢/, a, 
as in Table VIII, we find that 


w(x) = 1 — 0.894602e~°71* + 0.0439472e 090819 (112) 
— 0.03174938e 0944", 
The exact waiting-time distribution is 
w(x) = 1 — 0.8420579e~ °° — 0.03963069e °°°°979*_ (113) 


The numerical comparison is given in Table IX. 
As another GI/G/1 example, we consider 
1 


a ear TRY BAD 


(114) 


and 
B(s) = Yes + Ye} 3s, (115) 
Here the service consists of a step function with two values. The 


recursive equations are 
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1.226734 0.384559 
In = 0.157825 + In-1 ( ) (116) 


4 9p.) Tas 


and 


1.226734 0.384559 
On = (0.57625 + Qn-1 + 0.068909. (117) 


12545. 14 Pes 


We show the numerical values in Table X. The exact equilibrium 
values are j = 0.1798, a = 0.09266. 


Table X—G//G/1, two-step 
service time distribution, p = 


1/3. 

n J a 

1 0.1578 0.06891 
2 0.1741 0.08688 
3 0.1782 0.09278 
4 0.1795 0.09485 
5 0.1800 0.09558 
00 0.1802 0.0960 


For the last example, a D/G/1 will be considered. Let 


A(s)=e", Bs) = oH (118) 
(: + 3 5) 
then 
” sT os —2T 2 —2T 
R(s) = a R.(s) “7 ma — (119) 
(1+58) (+55) rs 
and 
R_(s) = R(s) — K.(s). (120) 
The recursion equations are 
Sn = e77(1 + 2T) + In-1R-(—Bn-1), (121) 
and 


On = [e27(1 + 27) + KR-(—gn-) Jon te77(1+ 7). (122) 


Tables XI through XIII show values for T = 2, 10/9, and 0.5, respec- 
tively. | 

To obtain exact equilibrium values for the stable queues, define the 
roots 6)(z), d2(z) by 


1 — %8,(z) = Vze7/27&@), (123) 
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Table XI—D/G/1, t= 2, p= 


0.5. 

n J a 

1 0.0916 0.05495 
2 0.1085 0.07016 
3 0.1136 0.07561 
4 0.1154 0.07775 
5 0.1162 0.07863 
oo 0.1167 0.07927 


Table XII—D/G/1, T = 10/9, p 


= 0.9. 

n J a 

1 0.3492 0.2288 
2 0.4614 0.3822 
3 0.5185 0.5025 
4 0.5545 0.6032 
5 0.5801 0.6906 
for) 0.7590 2.0631 


Table XIII—D/G/1, T= 0.5, p= 


2. 
n J a 
1 0.7358 0.5518 
2 0.8875 1.0716 
3 0.9362 1.5822 
4 0.9566 2.0893 
5 0.9672 2.5945 
10 0.9850 5.1101 
20 0.9928 10.1252 
WSo(z) — 1 = Vze7 2TH). (124) 
then 
; 1 1 
J (z) = —— } 1 — — 81(z)62(z) |, (125) 
1l-—z 4 
and 
1 -1 -1 
a(z) = ee 6:(z)~" + é2(z)-" — 14. (126) 
—z 
Thus, the equilibrium values are 
J=1-— %61(1)62(1), a = 6,(1)7' + 62(1)7? — 1; (127) 


and, hence, for T = 2, we have j = 0.1164, a = 0.07842, and for T = 
10/9, 7 = 0.7587, a = 1.9895. 
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Vi. SYNOPSIS 


We list the recursions and definitions of symbols here for ready 
reference. 


6.1 Definitions 


A(x): interarrival time distribution. 
B(x): service time distribution. 
W,.(x): waiting-time distribution of nth arrival. 


K(x) = [ B(x + y)dA(y). 
if) 


A: mean arrival rate. 
jt. mean service rate. 
p = 


A/p. 


K(s) -| e~"*dK(x). 
R,(s) = | e~*dK(x). 
0 
0O_ 

RK (s) = i e-*dK(x). 
A(s) = i e-*dA(x). 
0 


B(s) = | e~*dB(x). 
0 


W,(s) = | e~*dW,,(x). 
0 


Jn: probability nth arrival sees server is busy. 
Jn: approximate evaluation of jn. 

a,: mean waiting time of nth arrival. 

Q,: approximate evaluation of ap. 


En = J; nl An. 
All symbols without the subscript n designate equilibrium values. 
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6.2 Recursion formulae 
6.2.1 General—Transient 


Py Pe = RK,.(0) + In-1K(—Zn-1), 
dn = [K.(0) + R_(—gn-1) Jon—1 — K4(0), 


and 


s 


W,(s) = Wn-1(s)Ki(s) + R_(0) — In R_(—Bn-1). 





s n—-1 
6.2.2 General—Equilibrium 
J = K,(0) + JR_(-g), 
a = [K+(0) + K-(-g)]a — K4(0), 


and 
an | £10) - SR (3) 
1- K,(s) 7 St+8 all 
6.2.3 GI/M/1 
m Spee a se 
Jn = A(p) + Ale ae A(p)], 
1 = — 8n-1 
Ll 
x iy “ a Lise 
On = | A(u) + i [A(gn-1) — A(u)] | an1 + —A(y), 
pe 
1 — — 2n-1 
Ll 
J =A[p(l—J)], 
and 
_1l Jd 
an p1il-Jd 
6.2.4 M/G/1 
In=1— BQ) + Ina 
1+ oe 
ig Br 1 1 2 
an= | 1— B(A) je Qn-1 + 7-70 — B(A)], 
1+ yon) B 


J=j=p, 
1 p—1+B(d) 


#(1—p){1- BA)]’ 
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and 
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An Analysis of the Carrier-Sense Multiple- 
Access Protocol 


By D. P. HEYMAN 
(Manuscript received February 16, 1982) 


In this paper we analyze the throughput and delay characteristics 
of the Carrier-Sense Multiple-Access protocol with a queueing model. 
The effects of finite buffer size, bursty arrivals, and collision detection 
with exponential rescheduling are examined. 

The Carrier-Sense Multiple-Access protocol could be used on a bus 
in a packet switch. It works by sending a signal to all sources when 
the bus is occupied. A source postpones transmission to the bus when 
this signal is heard. Since the signal takes a positive amount of time 
to reach the sources, two or more sources occasionally will attempt to 
use the bus at about the same time. When this occurs, all the packets 
are destroyed and rescheduled. 

We conclude that: (i) it is important to choose the mean resched- 
uling time correctly, and (ii) performance degrades significantly 
when compound Poisson arrivals (peaked traffic) replace Poisson 
arrivals (smooth traffic). 


I. INTRODUCTION 


The Carrier-Sense Multiple-Access (CSMA) protocol has been pro- 
posed for resolving conflicts when several sources attempt to use a 
single channel. In this paper we investigate the throughput and delay 
characteristics of CSMA. 


1.1 Background 


When several sources attempt to use a single channel, a protocol (in 
this context, protocol is synonymous with queue discipline) is required 
to allocate the channel among the sources. When the channel is 
occupied, a signal is sent indicating that state of affairs. When a source 
wants to use the channel, it first listens for this signal. If the signal is 
not heard, the source starts transmitting. If the signal is heard, the 


2023 


source postpones transmitting and tries again at another time. The 
advantage of this system is that a device to control the sources is not 
required. The disadvantage is that occasionally messages will be de- 
stroyed because two sources will transmit at the same time. This is a 
consequence of the fact that signals travel at finite speeds, so there is 
a delay between the epoch when one source seizes an idle channel and 
the epochs when the other sources can first hear the busy-channel 
signal. Thus, soon after a source seizes the channel, another source 
may sense that the channel is free even though the channel is busy. 
When this occurs, both messages are destroyed because their bits have 
been merged. At the end (earlier if collision detection is used) of each 
transmission, the source determines whether the transmission was 
successful. If it was, the source goes about its business; if it was not, 
each message is rescheduled as if the channel were sensed as busy. 

The csMa protocol is used in the Ethernet* local-area distribution 
system and has been considered for part of a digital switch. Since CSMA 
is envisioned as a protocol for packet networks, we will refer to the 
arrivals as packets and the channel as a bus. 


1.2 Relation to other work 


The first study of csMA is Kleinrock and Tobagi.' In addition to the 
version of CSMA studied here, which they call nonpersistent csMA, they 
investigated various forms of persistent CSMA and slotted csMA that 
are not alluded to in this paper. Their paper implicitly assumes that 
the rescheduling delay is infinite. This means that packets which find 
the bus occupied or which are destroyed in a collision are abandoned. 
For the most part, we assume, as their paper does, that packets arrive 
according to a Poisson process. In a subsequent paper by Kleinrock 
and Tobagi,” the rescheduling times are modelled as geometrically 
distributed random variables. They used a finite number of sources 
and assumed the arrivals to be quasi-random (i.e., finite-source Pois- 
son). Also, they introduced an unnatural discretization of the time 
scale, which causes some small distortions in the results. A continuous 
version of this model is described and solved in Halfin.* Our analysis 
shares many features with the analysis found in Ref. 2, particularly 
the exploitation of the regeneration epochs. 

Tobagi and Hunt* add collision detection to the model of Kleinrock 
and Tobagi.” Collision detection is a feature that informs a source that 
its packet has been destroyed soon after the collision occurs. Our 
model of collision detection is based on the model presented in Ref. 4. 

Rappaport’ and Rappaport and Bose® describe an elaborate model 
of a protocol similar to CSMA. 


* Ethernet is a trademark of Xerox Corporation. 
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1.3 Summary of results 


The models considered in this paper can be easily solved numeri- 
cally. The following conclusions were reached by considering many 
numerical examples. 

(t) It is important to choose the mean rescheduling time correctly; 
performance is not sensitive to changes in this number near its best 
value. 

(it) If the mean rescheduling time is based on a designed load but 
the realized load is different, the realized performance is almost as 
good as if the mean rescheduling time were based on the realized load. 

(tit) When the traffic intensity is no larger than 0.7, the throughput 
is insignificantly lower than the traffic intensity. 

(tv) Performance is significantly degraded when compound Poisson 
arrivals (peaked traffic) replace Poisson arrivals (smooth traffic). 

(v) When the traffic intensity is no larger than 0.7, collision detec- 
tion lowers the average waiting time significantly and reduces the 
sensitivity of the throughput to the mean rescheduling time. For higher 
traffic intensities, collision detection significantly increases the 
throughput and decreases the average delay. 


1.4 Outline of this paper 


Our model is described in detail in Section IJ, and the method of 
solution is outlined. The details of the solution are presented in Section 
III. Numerical examples and empirical conclusions from the model are 
given in Section IV. In Section V, we consider the effects of bursty 
traffic by introducing compound Poisson arrivals. We return to Poisson 
arrivals in Section VI and examine the benefits of collision detection. 
Appendix A records the transition probabilities omitted from the text, 
and Appendix B lists the most important symbols used in the text. 


ll. MODEL AND OVERVIEW OF THE SOLUTION METHOD 


Our setting is a queue of the M/D/1/K type. Let A be the rate of the 
Poisson arrival process. There is a single server with a constant service 
time per packet. At most K packets may be in the system (queue plus 
server) at any time. Packets that arrive when the system is full are lost 
forever and have no effect on the system. Our use of constant service 
times reflects the assumption that all packets have the same length. 

The parameter K has two potential interpretations. One is to suppose 
there is a buffer that can hold K — 1 packets, so the maximum number 
of packets in the queue is K — 1. The other is to suppose there are K 
ports and a packet that cannot seize a port is lost. We will show that 
proper limiting probabilities will not exist without this bound on the 
number of packets in the system. 
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When a packet is on the bus (i.e., when it enters service), a signal is 
sent that warns other packets that the bus is occupied. (Equivalently, 
a signal indicating the bus is free is turned off when a packet is on the 
bus.) Let h denote the one-way propagation delay for this signal. It is 
common to all potential users. The propagation time between a source 
and the bus depends on the distance between them. If we calculate h 
for the source that is farthest from the bus, the assumption that A, the 
one-way propagation delay, is common to all users will underestimate 
the performance measures. This propagation delay has two effects. 
The first effect is that for the first h time units after a packet occupies 
a previously idle bus, other packets that arrive and want to use the 
bus will do so because the signal that the bus is occupied has not 
reached them. When this happens, we say that a “collision” has 
occurred and the first A time units of a service time are called the 
“vulnerable” period. The second effect is that the signal that the bus 
is free is not received until the bus has been free for h time units. 

When a collision occurs, the bits in both packets are scrambled, and 
so both packets must be retransmitted. We assume that the collision 
is detected at the end of the unsuccessful transmission. Then each 
source involved in the collision draws a number from an exponential 
distribution with mean 1/a, which is the length of time until the source 
next attempts to use the bus. Furthermore, all sources that attempt to 
transmit a packet and hear the signal that the bus is occupied make 
their next attempt in this fashion. We call a the “retry rate.” 

Our objective is to express the throughput (equivalently, the asymp- 
totic departure rate or the asymptotic proportion of time that the bus 
is successfully transmitting packets) and average waiting time of a 
packet (i.e., the departure time minus the arrival time) to the system 
parameters A, K, h, and a. 


2.1 Outline of solution 


Choose time units so that the processing time of a packet is unity. 
Consider a packet, #, that gets on an empty bus at time ¢. This pac- 
ket will relinquish the server at ¢ + 1 and the signal that F is on the 
bus will cease being sensed at some random time 7 witht+1+hs 
7 <=t+1+ 2h. If no packets arrive during the vulnerable period, then 
tr=t+1+4+ 2h. 

Suppose we assume that 7 holds the bus for a (nonrandom) length 
of time vp, with 1 = y = 1 + 2h, at the end of which 7 relinquishes the 
server and the signal that 7 is on the bus ceases. Retries that occur 
during the vulnerable period destroy the messages involved in the 
collision. These retries are not considered to have seized the bus. 

When p = i, these assumptions wili provide an upper bound for the 
throughput and a lower bound for the average number of customers 
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present (in a buffer or on the bus) in the steady state. When pv = 1 + 
2h, a lower bound for the throughput and an upper bound for the 
average number of customers present is obtained. Little’s theorem (the 
queueing formula L = AW) implies that setting » = 1 provides an 
upper bound for the average waiting time and that setting vy = 1 + 2h 
provides a lower bound. Therefore, we will solve a model with constant 
service times p. 

Since h is typically much smaller than one (we use h = 0.01 in our 
examples), we expect that the bounds will be close together. That is 
the case in our numerical examples. Therefore, either of the bounds is 
a good approximation to the true measure of performance. 

Let X(t) be the number of packets present at time ¢. We want to 
obtain 


pi = lim P{X(f) = 1} (1) 
t->00 
for each i = 1, 2, ---, K. These “limiting probabilities” are easily 


shown to be independent of the distribution of the initial state X(0). 
To obtain {pi}, we embed a discrete Markov chain at customer 
“ejection” epochs, which are defined as follows. Let C,, be the epoch 
where the mth packet to get on an empty bus leaves the bus. This may 
be the end of a successful transmission where the packet leaves the 
system or it may be the end of a destroyed transmission where the 
packet rejoins the queue. To encompass both types of events, we call 
C,, the mth ejection epoch. 

Let Yn = X(Cn) A limsy X(C,, + s); it represents the number of 
customers present just after C,,. Since the arrivals are Poisson and 
retrials are governed by an exponential distribution, it is easy to see 
that {Yn;m = 0,1, --+ } is a Markov chain. It is also easy to see that 
this chain is irreducible and aperiodic (details are omitted) so that the 
limits 

vm A lim P{Y, = 1}, i=0,1,---,K, 
exist and are independent of Yo, are positive, and sum to one. 

Three steps are used to obtain { pi} from {7;}. The first step is to 

relate {7;} to {pi} where 


p: A lim P{Y,, = i and transmission is successful}, 
no 


t=0,1,---,K—1. (2) 


The relationship is obtained by calculating the state-dependent prob- 
ability of having a collision. The second step is to use a “rate up equals 
rate down” argument to show 


Api = Spi, 1=0,1,---,K-1, (3) 
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where ¢ is the ejection rate. The third step is to obtain po by a renewal- 
reward argument. Then eq. (3) is used to calculate ¢ and the remaining 


pis. 


lll. SOLVING THE MODEL 


In this section we give details of the solution of the model described 
in Section II. 


3.1 Collision probabilities 
Let c, be the probability that n packets arrive in an interval of 
length, v; then 


: n=0,1,---. 


n 


= e 2 


n 


A well-known property of Poisson arrival processes (e.g., Corollary 5- 
13 in Heyman and Sobel’) is that the arrival epochs are iid and uniform 
over [0, v] when it is given that n > 0 arrivals occurred during [0, v]. 
Thus, for a service interval that starts with i packets in the queue, we 
have 


mn & P{no arrivals prior to h|n arrivals in a 
service interval that starts with i in queue} 





-(—). n= 0,1, --- and any. (4) 


v 
Use the memoryless property of the exponential distribution to obtain 


6; 4 P{no retries prior to A|n arrivals in a 
service that starts with i in queue} 


—tah 


=e ™, t=0,1,--- andanyn. (5) 


Since a collision is avoided if, and only if, there are no arrivals and no 
retries during the vulnerable period, we have 


d,(i) 4 P{no collision and n arrivals in a service 
interval|start with i in queue} 


= mCni, l, n = 0, (6) 
and 


d,(it) 4 P{collision and n arrivals in a service 
interval|start with z in queue} 


= (1 — mdi) en, 1, n = O. (7) 
Equations (6) and (7) are used to calculate the transition probabilities 
of the embedded Markov chain {Y,,; m = 0, 1, --> }. 
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3.2 Transition probabilities 


Recall from Section 2.1 that Y,, is the number of packets in the 
system just after the mth ejection epoch. All of these packets must be 
in the queue because of the propagation delay in broadcasting that the 
bus is available. Let 


Py & P{Ym+i =J| Ym = 1}. (8) 


In this section we present formulas for computing p,, 0 < 1,7 = K. It 
will be convenient to call both exogenous packet arrivals and retries 
“arrivals”; the former are “outside” arrivals. Then, for 2<i=s K—-1 
and0Q<it+tn<=K — 2, we may write 


Diitn = P{next arrival is from outside|i in queue} 
x [P{n outside arrivals during service and no collision|service 
starts with i in queue} 
+ P{n— 1 outside arrivals during service and a collision|service 
starts with 7 in queue} | 
+ P{next arrival is a retry|z in queue} 
x [P{n + 1 outside arrivals during service and no colli- 
sion|service starts with z — 1 in queue} 
+ P{n outside arrivals during service and a collision|service 
starts with i — 1 in queue} ]. 
Use the memoryless property of the exponential distribution to obtain 





P{next arrival is from outside|z in queue} = —. 
A + la 
Set 8 = A/a for notational simplicity; then 

Prison = PEGDA) + del] + 2G [den - 1) +e - DI 9) 
To obtain the remaining entries of (pj) we need to account for 
boundary conditions. The details and the formulas are given in Ap- 
pendix A. 

It is easy to see that (pj) is irreducible and aperiodic. Therefore, 
there is a unique stationary distribution that is also the limiting 
distribution. Furthermore, since the continuous-time process {X(f); 
t = 0} starts from scratch (i.e., it regenerates) each time the Y-process 
reaches zero, the X-process is regenerative and the mean time between 
regeneration points is finite. This implies the limits in eq. (1) exist. A 
similar argument establishes that the limits in eq. (2) exist. 

We can use eq. (9) to explain why the finite capacity, K, is essential 
for our analysis. Suppose K = ©; then eq. (9) is valid for 1 = 2 and 
every n = 0. Equation (9) is also valid for n = —1 if we set d,(i) = 0 
whenever j < 0. Define 
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eo 


gi= » NPii+n — Pii-1; 


it is the expected size of a jump out of state z in the Y-process. 
Intuitively, if lim;... g: > 0, the Y-process is tending to infinity and 
limmoP{Ym = J} = 0 for all 7. From eq. (9) we can calculate (details 


are omitted) 
pe en Ba = ee) 7 if[e" Ye” = e*) 4 ewe] 
: i+B i+ B ‘ 


and so, if h > 0, 
lim gi = dy». (10) 


This phenomenon can be described physically. When / is large, e 7" 
is small so the probability that an ejection is a completion is very 
small. This means that once state j is reached (as it must be because 
the process is irreducible), the number of customers present grows 
monotonically with very high probability. 

The corollary in Kaplan® states that an irreducible and aperiodic 
Markov chain for which eq. (10) holds is not ergodic if p,, = 0 whenever 
j<i-—k for some k that is independent of i. At most, one departure 
can occur at a time, so pj; = 0 whenever 7 < i — 1. Therefore, the Y- 
process is not ergodic when K = ©. 


3.3 Solving the balance equations 


Instead of solving 7 = 7P to obtain {7;}, we will solve balance 
equations between two sets of states. Let . 


K 
p(Z=J) & P{Ymt ZJI| Ym = 1} = Y Diz. 
kaj 
In the steady state, the rate of transitions between states {0, 1, --+ , 7} 
and states {i + 1,7 + 2, --- , K} must be the same in both directions, 


so* 


TitiPi+i1i = DY TrDR=L + 1), l= 0, I, ee ho; (11) 
k=0 


We can solve eq. (11) by replacing 7; by x; and setting xo = 1, and using 
eq. (11) to obtain x;+1 from xo, +--+ , x; By setting 


K 
Ti = Xi > Xj 
i=0 
we obtain a nonnegative solution of eq. (11) that sums to one. 


* The basic idea was developed by Robert Morris and Eric Wolman. The details for 
Markov chains are given in Theorem 7-13 of Heyman and Sobel.’ 
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3.4 Departure point probabilities 


Recall that C,, is the mth ejection epoch. Let us write C,, = D when 
Cn is a departure epoch. Now 


P{Cn41 = D, Ym+1 =j} 
K 
i=0 


for every 7 and m. We will see below that P{Cnii1 = D, Ynii = 
J| Ym = i} does not depend on m, so denote it by qj. Then 


K 
pj = lim P{Chai = D, Ynsr = J} = Y Qi 
j=0,1,---,K—-1. (12) 


Observe that, forl=7= K-—2and1<isj—-1, P{Cna =D, Ys 

=J|Yn = 1} 

= P {next arrival is from outside|z in queue} 

x P {j — i outside arrivals during service and no collision|service 
starts with 7 in queue} 

+ P {next arrival is a retry|i in queue} 

x P {j—1+ 1 outside arrivals during service and no collision|service 
starts with z — 1 in queue} 


= Apdo + Fe dj-in(i — 1). 


The remaining entries are obtained by accounting for boundary be- 
havior. The results are: 








Joo = CoNo = Co, (13a) 
C 
qio = 1 a (13b) 
and 
gqo=0 for 122. (13c) 
ForlsjsK-— 2, 
qoj = d,(0), (14a) 
Boos 4 toy ., er 
qi = Bt 7 Gj) of Bri G-in(i- 1), 1sSisj, (14b) 


J = 
. = 7, 
Q+1,j B+jrl o(J), (14c) 
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and 
qi=0, tEsjr+2. (14d) 


When j = K — 1 we obtain 


qo,x-1 = > d,(0), (15a) 


gina =o > dali) + =z 3 dé», 
B+1 xX ie 






(15b) 
QK,K-1 = Ox-1. 


In eq. (46) we show that the infinite sums in eq. (15) have represen- 

tations as finite sums containing no more than K terms. Since Cr41 = 

D and Yn+i1 = K cannot occur simultaneously, gix = 0 for every 1. 
Notice that (the subscript « denotes a limit) 


ne & x pt = P{C. =D, Yo=K-V 


= P{an ejection epoch is a departure epoch} 
= P{no collision}, (16) 


where the last two probabilities are steady-state quantities and can be 
interpreted as long-run proportions. 


3.5 Converting {pi} to { pi} 


Let E(t) be the number of ejections by time ¢. Since {E(¢); t = 0} 
regenerates when an ejection leaves the system empty (i.e., when 
Ym = 0 for some m), 


¢ 4 lim E(t)/t 


exists. It is the ejection rate. It is also the rate at which packets gain 
access to an idle bus, so ¢ is the arrival rate of new and rescheduled 
packets, which is denoted by G in Kleinrock and Tobagi.' 

The rate at which the number of packets present jumps from 7 to 
i + 1 is Ap;. The rate at which it jumps from 7 + 1 to z is {pi. In the 
steady state these rates must be equal so we have 


Lemma 1: Ap; = §pi, t=0,1,---,K-—1. (17) 


Lemma 1 can be proved rigorously. The proof uses standard methods 
and is omitted. 
There are K + 1 unknowns in eq. (17) and K equations. We will find 
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Po by an independent argument; the remaining p,;’s and ¢ are obtained 
from eq. (17). 

Let To be the amount of time X(-) is zero in an arbitrary cycle of the 
X-process, and let M be the expected length of a regeneration cycle. A 
basic property of regenerative processes (see, e.g., Theorem 6-7 in 
Heyman and Sobel’) is that 


Po = E(To)/M. (18) 
Since X(t) = 0 if, and only if, ¢ is in an idle period, 
E(T) = 1/X. (19) 


When Y = 1, let ); be the average time between en ejection epoch and 
the next ejection epoch. We have 





vp+- i<kKk 
lata 
Yi = ‘ 
+ — if 1=K 20 


because vy; is v plus the expected time to the next arrival after the 
ejection epoch. Let m; be the mean number of visits of the Y-process 
to state i during an arbitrary regeneration cycle of the X-process. Thus, 
m; is the mean number of visits of the Y-process to state 1 between 
visits of the Y-process to state zero, because the X-process regenerates 
whenever the Y-process enters state zero. It is easy to show (see 
Exercise 7-78 in Heyman and Sobel’) that 


m; = 7/70, t=0,1,---,K. (21) 
Using eqs. (20) and (21) we obtain 


K 1 K-1 1 1 
= mM; = — i : + a 
M 2 yin a » TT, (+45) TK (>+z) 


_1i Kl Ba; Bax 
-25 (r+ d ak a 








where p = Ap. 
Substituting eqs. (19) and (22) into (18) yields 





K-1 
Bri Barr 
= + — |. 23 
Po nf (o+ 3 ers K (23) 
Obviously [and formally from eqs. (12) and (13)], we find that 
Po = 70. (24) 
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Combining eq. (17) with i = 0 and eqs. (23) and (24) yields 
K-1 
= i= Bai Brx 
p= rooipo=r/ (p+ Pe 4 Be ; (25) 


Equation (25) shows that { can be computed when the balance equa- 
tions are solved. 





3.6 The throughput, occupancy, and average waiting time 


The throughput, @, is the asymptotic departure rate. Since the 
departure process regenerates whenever the ejection process regener- 
ates, @ is well defined. The asymptotic departure rate equals the 
asymptotic rate at which packets are accepted, because all accepted 
packets eventually depart and the number of packets present is no 
larger than K; therefore 

K-1 


G=2X > Di. (26) 
0 


Combining eq. (26) with eqs. (16) and (17) yields 
6 = ine. (27) 


This equation states that the departure rate equals the ejection rate 
multiplied by the asymptotic proportion of ejections that do not suffer 
a collision. Equation (27) illuminates the essential trade-offs involved 
in using CSMA. One expects that ¢ increases and n, decreases as \ and 
a increase. This means that for each A, there is a value of a that 
maximizes 9. Let 6*(A) be the largest value of # that can be achieved 
when A is specified. We also expect that @*(A) will first increase with 
A and then start decreasing. 

Let ¢ be the long-run proportion of time that the bus is occupied. 
The regenerative arguments used above can be used to prove that ¢ is 
well defined and that 


K 
6=vr>¥m/M. (28) 
0 


Using eqs. (22) through (25) in conjunction with eq. (28) yields 
= vs. (29) 


This equation can be obtained from Little’s theorem by regarding 
the bus as “the system.” Then § is the arrival rate (since only packets 
that depart could have arrived) and rp is the expected time a packet is 
on the bus. Little’s theorem asserts that {v is the average number of 
packets on the bus, which is the proportion of time that the bus is 


samy 


@ = 00/ne. (30) 
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Suppose that we attempt to increase @ by increasing the arrival rate, 
and K and a are adjusted to keep n, constant (i.e., a fixed proportion 
of collisions). Equation (27) shows that increases in @ must be accom- 
panied by decreases in the occupancy of the bus, and that doubling 
the throughput would halve the occupancy. 

From eqs. (17) and (25), p; for 1 = 1 = K — 1 is obtained in the 
obvious way and px is obtained from px = 1 — ¥.f7" p;. Then 


K 
L=) ipi 
1 


is the average number of packets present in the steady state. Since @ 
is the arrival rate of packets that enter the system, Little’s theorem 
yields 


W=L/6, (31) 
where W is the average length of time that a packet is in the system. 


3.7 Relation to the model of Kleinrock and Tobagi 


In this section we explain how the “basic equation for the through- 
put” [eq. (3) in Ref. 1] can be obtained from the model in this paper. 
This will clarify the similarities and differences between the two 
models. 

In Ref. 1 Assumption 1 states that the average retry interval is large 
compared with the packet transmission time. In our notation, this 
assumption is that 1/a is large compared with v. Since there is no 
parameter corresponding to a in eq. (3) of Ref. 1, it appears that they 
have set a = 0. Let us do that. This means that every transmission 
that is destroyed by a collision stays in the queue forever. Conse- 
quently, Assumption 2 in Ref. 1, which states that the interarrival 
times of the point process consisting of packet arrival epochs and retry 
epochs is a Poisson process, is valid because there are no retry epochs 
and the arrival epochs form a Poisson process. 

Another consequence of a = 0 is that without a finite buffer, 
lim,..P{X(t) = 1} = 0 for all 1 because each packet transmission has 
a positive probability of being destroyed by a collision. Therefore, we 
make these two assumptions. 

Assumption 3. There is a finite buffer that can hold K —-120 
packets. 

Assumption 4. If a packet transmission is destroyed when the buffer 
is full, that packet is flushed from the system. 

The next assumption is required to replicate the collision process in 
Ref. 1. 

Assumption 5. Packets that arrive when the buffer is full and a 
packet is being transmitted will destroy that transmission. 
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With these assumptions and a = 0, the arguments and formulas in 
Sections 3.2 through 3.5 can be used to obtain the results given below, 
but we will bypass that roundabout route and give a direct argument 
' (which is essentially the argument in Ref. 1). 

In the steady state, whenever a packet seizes the bus, there are 
K — 1 packets in the buffer. The mean time between entries to state 
K — 1 is, via eq. (20), » + 1/A, and 1/A is the mean length of stay in 
state K — 1, so 


yr... A 
y+1/A 1+dAr 





(32) 


PK-1 >= 


The probability that a transmission is not destroyed by a collision is 


e ', So 
g= Apr-ie™”. (33) 


Equation (7) of Ref. 1 and a = 0 show that 


yp=1+2h—(1—e™)/X. (34) 
Substituting eqs. (32) and (34) into eq. (33) yields 
he? 


- AQ + 2h) +e’ 


which is eq. (3) in Ref. 1. 

We can conclude that eq. (3) in Ref. 1 is valid when a = 0 and the 
derivation in Ref. 1 requires assumptions 3, 4, and 5. Our model does 
not use assumptions 4 and 5 so it should produce a greater throughput 
when all other factors are the same; this is demonstrated numerically 
in Section IV. Since K = 1 is allowed, this derivation explains why eq. 
(3) in Ref. 1 becomes the single-server Erlang loss formula when there 
are no collisions, i.e., why h | 0 yields eq. (9) in Ref. 1. 


IV. NUMERICAL RESULTS 


A Fortran program to solve the equations in Section III was imple- 
mented in double-precision arithmetic on a PDP-1170. Because of 
memory limitations and the design of the program, K =< 30 was 
required. The running time on a CRT display terminal with a 9600-baud 
line is less than a twinkling of an eye. 

In all the numerical examples we use h = 0.01. This is the value 
used by other authors. It is a reasonable value to use for a system such 
as Ethernet with loops of about 300 meters carrying 128-byte packets 
at 10 megabits/sec. In general, ; 


hv=—, 
sc 
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where 
l = loop length in meters, 
r = transmission rate in bits/second, 
s = packet size in bits, and 


c = speed of light in m/second. 


4.1 The upper and lower bounds are close 


In Table I we present various cases that demonstrate that the lower 
and upper bounds for 6 and W are close. The subscript “avg” stands 
for average and the entries are obtained by setting vy = 1 + h. In all 
these cases, K = 20. 

Table I suggests the approximations 


Wave = (Wa + Wis) and Dave = Y2(O15 + Ius). 


We make no claim for the general validity of these approximations. 
Based on the encouraging results in Table I, and on several dozen 
other examples not reported here, we will henceforth use the case pv = 
1 + A as an approximation; there will be no explicit mention that our 
numerical results are approximate. 


4.2 The effects of changing a 


The first two pairs of columns in Table I show that for fixed 4, 
different values of a yield different values of the performance measures. 
In this section we explore the consequences of changing a. 

When K and X are given, the maximum achievable throughput is 
attained when there are no lost service times from collisions and 
instantaneous retries, i.e., by an M/D/1/K queue. Let the throughput 
of the M/D/1/K queue be denoted by @max. When K = 20 and A = 0.7, 
O@max = 0.700 to three decimal places. In Table II we show that good 
choices of « will achieve @ = 0.699, but the throughput for poor choices 
of a have a much lower value. In particular, very small values of a do 


Table I—Evidence that the bounds for @ and W are close 


A 0.7 0.7 0.5 0.9 1.0 3.0 

a 0.01 3.0 0.5 1.0 1.0 2.0 
us 0.459 0.673 0.818 0.803 0.798 0.560 
Bavg 0.457 0.667 0.812 0.796 0.790 0.555 
1 0.455 0.660 0.806 0.788 0.782 0.549 
Wie 42.1 11.4 19.6 19.9 22.7 30.8 
Warg 41.9 10.2 19.1 19.3 22.3 35.5 
Wi 41.7 9.1 18.7 18.7 21.9 35.1 
avg Nec 0.991 0.828 0.923 0.861 0.824 0.570 
avg @ 0.466 0.814 0.888 0.933 0.947 0.984 
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not do well. This confirms the observation given in Section 3.7 that 
the model in Ref. 1, in which a = 0, underestimates the throughput. 

The explanation for the qualitative properties shown in Table II is 
as follows. When a is very small, a packet that upon arrival finds 
another packet in service will spend a long time waiting to retry. This 
makes collisions rare but tends to keep the buffer positions full, which 
makes the throughput and bus utilization low. As a is increased, the 
retries are more frequent, and although there are more collisions, this 
is more than offset by the shorter times spent waiting to retry. When 
a is made sufficiently large, the retries occur so rapidly that too many 
packets are destroyed and performance degrades. 

A significant feature of the data in Table II is that @ does not vary 
much for 0.5 = a = 1.6. The variation within the interval 0.5 = a < 1.4 
is in the fourth decimal place. The value of « that achieves the lowest 
value of W also achieves a high value of 6, but the value of a that 
achieves the highest value of # has a value of W about 9 percent larger 
than the best value of W we have found. 

Let 6* denote the largest value of @ that is found for fixed values of 
A, A, and K. Numerical results not reported here have shown that 6*/ 
Omax increases as A decreases. This is not surprising because lower input 
rates result in fewer collisions. Since 6* = 0.6993 and @max = 0.700, 0*/ 
Amax is very close to one. This suggests that CcsMA will provide good 
throughput performance when A < 0.7, and that collision detection 
schemes will not increase throughput very much when JA S 0.7. 


4.3 The effects of changing X 


Take K = 20 and h = 0.01. Choose a so that the throughput is 
maximized. The effects of changing A are shown in Table III. 

Notice that 8 increases with A for A = 2 and then @ decreases very 
slightly when A = 3. This indicates that the phenomenon of lower 
throughput with higher arrival rate will not occur in the normal 
operating range of A < 1. Table III also shows that a should not 
increase as A increases. This property also appeared in every example 
we tried. 


Table II—Performance measures vs. a when K = 20 andA = 0.7 


a 0.001 0.01 0.1 0.5 0.8 1.0 

0 0.362 0.457 0.660 0.6989 0.6993* 0.6992 
Ww 53.9 41.9 22.8 8.34 6.51 6.51 
Nn, 0.993 0.991 0.979 0.968 0.963 0.963 
o 0.368 0.468 0.681 0.729 0.734 0.734 
a 1.4 1.6 2.0 3.0 4.0 5.0 

0 0.6986 0.6980 0.696 0.667 0.556 0.423 
Ww 5.53 5.52 5.87 10.2 24.1 42.1 
Nn, 0.949 0.943 0.927 0.828 0.612 0.437 
@ 0.743 0.747 0.758 0.814 0.917 0.977 
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The average waiting times shown in Table III are about the same as 
the corresponding quantities for 1000 sources shown in Fig. 3 of 
Ref. 2. 

Suppose we design a system for a nominal arrival rate and from time 
to time the actual arrival rate differs from the nominal arrival rate. In 
Table II we see that performance suffers if a poor value of a is chosen. 
Now we show that moderate changes in A will not cause much 
degradation in the throughput rate or the average waiting time. 

We assume that K = 10 and h = 0.01 are held fixed. The nominal 
arrival rate is 0.7 and a = 1.6 yields the largest throughput rate. We 
also assume that when A changes, the new value is maintained long 
enough to ensure that steady-state performance measures are ade- 
quate. If we achieve good performance with arrival rates A; and Az, it 
is reasonable to suppose that we will achieve reasonable performance 
when A is changing from A, to Az. 

We will use the notations 9* and W* for the best values of @ and W 
we have found by varying a. The entries marked @ and W are for 
a= 1.6. 

Table IV shows that, for the values chosen, deviations from the 
nominal load will not cause serious performance degradation if a is 
kept fixed. 


4.4 The effects of changing K 


As K is increased, more packets can be stored, so the throughput 
should increase. This cannot be carried too far because as K > o, 
limp. Yn = © (as shown in Section 3.2), which causes the probability 
of a collision to approach 1 and @ — 0. We have not found a value of 
K to demonstrate this numerically. 


Table III—Performance measures vs. A when K = 20 and h = 0.01 

r 0.7 0.9 1.0 2.0 3.0 

a 0.8 0.6 0.5 0.5 0.4 

6 0.699 0.813 0.817 0.818 0.817 

Crise 0.700 0.898 0.975 1.00 1.00 

W 6.51 18.9 21.4 23.9 24.1 

Ne 0.963 0.911 0.914 0.913 0.905 

o 0.734 0.901 0.904 0.905 0.912 


Table IV—é@ and W vs. A when a = 1.6, K = 10, and h = 0.01 


r 0.5 0.6 0.7 0.8 0.9 1.0 
ty] 0.500 0.599 0.692 0.764 0.801 0.812 
o* 0.500 0.599 0.692 0.764 0.801 0.815 
Ww 2.30 3.07 4.37 6.22 8.06 9.39 
w* 2.18 2.65 4.18 6.09 8.05 9.38 
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In Table V below we use the following notations: 
0* = largest value of @ found 
ag = value of a that achieves 6* 
W* = smallest value of W found 
ai = value of a that achieves W* 
Wag) = Wwhena= ag 
(atv) = 6 when a = a 
O@max = throughput of M/D/1/K queue. 


In Table V, 6* and W* increase with K; a jv and a@ decrease with K; 
aw = ag, and the difference is small and gets smaller as K increases; 
W(ag) is not much bigger than W*; and @(ajv) is not much smaller 
than 6*. 


4.5 Conclusions 


From the data presented in this section, and from dozens of unre- 
ported sets of calculations, we conclude that: 
(t) The bounds on @ and W are close, and choosing v = 1 + A yields 
a good approximation. 
(tt) It is important to choose a good value of a, and performance is 
not sensitive to changes in a near the best value. 
(tii) The value of a that minimizes W is close to the value of a that 
maximizes @. 
(iv) The best value of a decreases as A and K increase. 
(v) For A = 0.7, 6* is essentially the same as Omax when K = 5. 
(vi) Although a should vary with A, 6 and W do not significantly 
differ from their best values if a is held fixed while 4 varies over a 
moderately wide interval. 


V. BATCH ARRIVALS 


In this section we replace the assumption that packets arrive ac- 
cording to a Poisson process with the assumption that packets arrive 


Table V—Performance measures vs. K when A = 0.9 and h = 0.01 


K 5 10 15 20 30 
9mnax 0.842 0.885 0.895 0.898 0.900 
6 0.771 0.801 0.810 0.813 0.814 
O(aw) 0.771 0.798 0.808 0.811 0.814 
ag 3.0 14 0.8 0.6 0.4 
aw 4.0 1.8 1.0 0.7 0.4 
w* 3.61 8.05 13.2 18.8 30.5 
W (a3) 3.66 8.12 13.4 “18.9 _ 80.5 
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according to a compound Poisson process. In Fuchs and Jackson,” 
statistical analysis of call arrival times are given. Two of their conclu- 
sions are as follows: The exponential distribution is a reasonably good 
approximation of the time between bursts, and the size of a burst 
(measured in various ways) has a geometric distribution. The purpose 
of this investigation is to find out how sensitive the performance 
measures are to the assumption of Poisson arrivals. We will see that in 
this model, the throughput can be significantly lower with bursty 
arrivals than with Poisson arrivals with same rate. 

Specifically, we assume that bursts arrive according to a Poisson 
process with rate A», and the bursts Bi, Be, --+ are iid with 


P{B, =i} =(1-€)é5, i= 1,2,---. (35) 


It would be nice to interpret this process as one where messages arrive 
according to a Poisson process with rate A, and the jth message 
consists of a random number of packets with a geometric distribution. 
Unfortunately, we cannot do that in the context of the model described 
in Section II. This arrival process does not conform to the finite buffer 
interpretation of the model because only the current lead packet in 
each message would compete for the bus; we, on the other hand, 
assume that all packets in the buffer compete for the bus. This arrival 
process does not conform to the interpretation that there are K ports 
because that would require that if the first packet (i.e., the message) 
finds a free port, then all the packets in the message would enter the 
system. We assume that only those packets that find a port gain access 
to the system. 


5.1 Details of the arrival process 


Let A(t) be the number of arrivals during (0, T'). Since A(¢) is a 
compound Poisson process, we know that there are constants M and 
V, such that 


E[A(t)] = Mt 
and 
Var[A(t)] = Vt 
with V= M. We call M the arrival rate. From eq. (35) we obtain 


and Var(B,) = ee 


1 
MOU Te (1 — €)?" 





Standard calculations yield 
Aot 


E[A(t)] = AstE (Bi) = — z 
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and 
Var[A(t)] = Ast Var(Bi) + AstlE(By) 7? 














_ Aot(E + 1) 
(1—&)* © 
In an obvious way we obtain 
2M” V-M 
Say OO RE eV 
Letting z = V/M = 1 yields 
2M z-1 
= d €= : 
Me 1+2Z ands z+1 38) 


Equation (36) relates the parameters we might obtain from measure- 
ments, M and z, to the parameters of the model, A, and €. Let 


b(n, 7) = P{B, + Bo + --» B=n}; 


it is the probability that n packets are contained in j bursts. The sum 
of iid geometric random variables has a negative binomial distribution, 


0 if n<J 
ae =) (1-é)//é"% if n=j 


for all nonnegative integers 7 and n. We can compute b(n, j) by 
recursion from 


(0, 0) = J; (1, 1) =l]- g, 
b(7,,7)=(1-§)b7-Ly-1), s22, 


and 
eo sl 
Wj +i) = 2 gj +h 1,9), kh=1 and j=0. 


5.2 Changes in the Poisson model 


In this section we describe the changes in the equations in Section 
III that are caused by compound Poisson arrivals. 
Let cs(7) be the probability that 7 batches arrive in an interval of 
length pv; therefore, 
—A,p (Asp)? 
ae 


co(j) =e (37) 


Let 


d,(i,j) = P{no collision and n arrivals in a service interval|/ 
batches arrive and start with z in queue}; 
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then 


d,(t) = dn(i, J)ev(J) . (38) 


Me, 
1M = 


Expand on the argument used to obtain eq. (6) to conclude that 


d,(i, j) = 6:njb(n, j). (39) 
Similarly, 
d,(t) = » (1 — 6:n;) b(n, J)es(7). (40) 
c 


The transition probabilities described in Section 3.2 and Appendix 
A are given in terms of the d and d functions, so they do not change 
form. The infinite sums, used in Section A.1 of Appendix A, have to be 
recomputed; this is done in Section A.2 of Appendix A. 

The steady-state balance equations depend only on the transition 
probabilities, so everything in Section 3.3 is still valid. No changes are 
required for the equations in Section 3.4. 

Since the arrivals do not form a Poisson process, time-average 
probabilities need not equal arrival epoch probabilities and we cannot 
use the arguments in Section 3.5 to obtain the steady-state probability 
that i packets are present. This means we have not obtained L and W 
for this model. We can obtain the throughput by using the arguments 
in Section 3.5. 

Let ¢ be the ejection rate and po = lim,_,. P{X(t) = 0}. In the steady 
state, A,po is the rate at which transitions leave state 0 and {7 is the 
rate at which transitions enter state 0. Since these rates are equal, we 
have 


£ = Awpo/ 70. (41) 


Equations (18) through (25) are valid for compound Poisson arrivals 
when A is replaced by A». The throughput is obtained from eq. (27). 


5.3 Numerical examples 


Let 6, be the throughput when z is the variance to mean ratio. In 
Tables VI and VII we use the value of a that maximizes 6, and h = 
0.01. 

In Tables VI and VII we see that batch arrivals can significantly 
degrade throughput, and the degradation increases with z and de- 
creases (except, possibly, for very large A) with X. 

In Section IV we saw that @ is strongly influenced by a. That 
observation suggests that the throughput values in Tables I and II 
might improve if a varied with z. Our numerical experience suggests 
that, in the vicinity of the best a, @, is not sensitive to changes in a. 
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Table VI—Throughput vs. M for K = 5 and h = 0.01 


M 0.1 0.5 0.7 0.9 1.0 2.0 5.0 
a 20.0 4.0 4.0 3.0 3.0 2.0 2.0 
0; 0.100 0.497 0.667 0.771 0.798 0.828 0.819 
2 0.069 0.382 0.535 0.663 0.709 0.819 0.813 
4s 0.035 0.205 0.304 0.403 0.451 0.741 0.804 
42/6 0.689 0.769 0.868 0.860 0.888 0.989 0.993 
4; /6; 0.349 0.412 0.456 0.523 0.565 0.895 0.982 


Table Vil—Throughput vs. M for K = 10 and h = 0.01 


M 0.1 0.5 0.7 0.9 1.0 2.0 5.0 
a 10.0 1.5 1.6 1.4 1.2 0.9 0.6 
a 0.100 0.500 0.692 0.801 0.815 0.821 0.816 
a, 0.069 0.400 0.590 0.737 0.775 0.816 0.805 
05 0.035 0.235 0.371 0.512 0.576 0.798 0.788 
82/8; 0.690 0.800 0.853 0.920 0.951 0.994 0.987 
85/8; 0.350 0.470 0.536 0.639 0.707 0.972 0.966 


The largest improvement that we could achieve by changing a was 
0.003. 


VI. COLLISION DETECTION 


Suppose that a time units after the vulnerable period ends, the bus 
is examined for a collision. When a collision is detected, the packet 
transmission is aborted and that packet is returned to the buffer. 
Collision detection reduces the time spent transmitting garbage, so it 
will increase throughput and reduce delays. 


6.1 Changes in the model without collision detection 


Two changes in the equations in Section III are required to describe 
collision detection. The first change is in eq. (7). Let 


nn = P{no outside arrivals prior to h|n outside arrivals 
prior to a + h}; 


then 





Hi n 
n= ’ = VU, 4, e°°. 42 
(4) wee 2) 


Let c% be the probability than n outside arrivals occur in an interval of 
length a + h, ie., 


s (a + h)” 
ef = @ Math) —, n=0,1,-+5; 
n!} 
then 
di(i) 4 P{collision and n outside arrivals by a + | service 
starts with i in queue} 
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=(1-nndi}en, in20, (43) 


where 6; is given by eq. (5). 

The equations in Sections 3.2, 3.3, and 3.4 are given in terms of the 
functions d and d, so no changes are required. In Section 3.5 we need 
to change eq. (20) because the expected length of time that a packet 
is on the bus is not pv. Let 


p-(t) = P{a collision occurs| service starts with i in the queue}. 


There are no collisions if, and only if, no arrivals (inside or outside) 
occur within a time interval of length h. Thus, 


1 — p(t) = d:c6 
or 
pct) = 1 — 6:c6, i=0,1,---,K-2. (44a) 
When i: = K — 1, no outside arrivals are permitted, so 
pAK — 1) =1— 8x-1. (44b) 


Let 7; be the transmission time of a packet, given that i packets 
were present at the last ejection epoch, and let v; = E(T;); then 


> 


pr ajit A if no collision 
‘  |a+ hifa collision 


and so 





=> 


{(1 + A)[1 — pe(i)] + (a + h)p-(i)} 


es — + (1 + ALL — peli — 1)] + (@ + A)peli — 1)} 


la+2X 





Bp-(t) + ip(i-1) , 
=1+h-(1—- a) ———_——__—"_ j= aii SIC ce 
(1 — a) i+B ,t=0,1,---,K—-—1. (45) 
We replace the » in eq. (20) by v;, which induces some obvious changes 
in eqs. (22) through (25). 


6.2 Numerical examples 


In Tables II and III we saw that when A = 0.7, the throughput in the 
vicinity of the best a (0.8) is very close to 0.7; but when a <= 0.1 or 
a = 3.0, the throughput can be much smaller than 0.7. With collision 
detection, with a = 0.02, throughputs of about 0.7 can be achieved with 
a as large as 5.0. Collision detection reduces the average waiting time 
by about one-third. This example suggests that when a throughput 
close to the maximum possible can be achieved without collision 
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Table Vill—Performance measures vs. A when 
K = 20, h = 0.01, anda = 0.02 


A 0.9 1.0 2.0 3.0 
a 4.5 3.0 2.5 2.5 

0 0.891 0.935 0.943 0.942 
WwW 8.53 14.9 20.6 20.9 
Ne 0.718 0.673 ‘ 0.628 0.619 
cy 0.910 0.958 0.969 0.970 


Table IX—Performance measures without 
collision detection 


A 0.9 1.0 2.0 3.0 
a 4.5 3.0 2.5 2.5 
0 0.434 0.569 0.606 0.602 
Ww 44.2 33.3 32.4 32.9 
Ng 0.444 0.585 0.625 0.620 
¢ 0.989 0.983 0.980 0.981 


detection, collision detection will lower the average waiting time and 
make the throughput less sensitive to a. 

Now we investigate the effects of collision detection when the 
throughput is significantly smaller than the arrival rate. 

By comparing Tables III and VIII we can see the effects of collision 
detection. Throughput increases significantly for each A and W de- 
creases. An indirect effect of collision detection is that larger values of 
a are best. This reduces the time spent waiting for a retry, makes the 
probability of no collision smaller, and yields a larger occupancy for 
the bus. If the values of a shown in Table VIII were used without 
collision detection, performance would degrade significantly, as shown 
in Table [X. From the last two rows of Table [X we deduce that the 
bus wastes a lot of time transmitting packets that have been destroyed. 
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APPENDIX A 
Transition Probabilities 


In Appendix A we record the transition probabilities omitted from 
the text. 


A.1 Transition probabilities for Section 3.2 
Let 


Sa(j, i) = Y dali) 


n=j 
and 
Salj, i) =D dali). 


n=j 
The former is the probability that more than j — 1 packets arrive in a 
service interval and there are no collisions when i packets are in the 
buffer at the start of the service interval. The latter is the correspond- 
ing probability when there is a collision. For computational purposes 
it is important to represent these infinite sums as finite sums. Observe 
that 


a 
Sa(j, t) = Sz(0,t) — Y dnt), = J 21, 


n=0 
and 
j-l 
n=0 
then 
Sa(0, i) = Y di) = 6: (—*) ew BT =e. (46) 
n=0 0 v nN: 


Since S7(0, 7) + Sa(0, 1) = 1 is easily established, 


S.(0,1) =1—e™; (47) 


this equation also can be obtained by calculating the sum explicitly. 
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To modify eq. (9) when i = 0 we have to delete terms with negative 
arguments. When 7 + n = K — 1 we need to recognize that state K — 
1 is reached if n = K — 1 — i packets arrive and there is no collision. 
When i + n = K, we need to recognize that state K is reached if, and 
only if, at least K — 1 — i packets arrive and there is a collision. 

The following transition probabilities are obtained: 


Poo = do(0) 
po =G(0)+gG(0) 1sjsK-2 


¥ d,(0) + dx-o(0) = S7(K — 1, 0) + dx-2(0) 
K-1 




















Po,K-1 = 

pepe > d,(0) = Su(K — 1, 0) 

Pro = 7 FH 

ee dy +—2 di(0) + do(0) 
oe eee i+ 9 lt (0(O) | 
pees AF gld) + deat] +5 ld (0) +dj-a(0)], 
2<j=K-2 

His ea eke 23 a 

1,K-1 1 + B ’ 


a ea [Sz(K — 1, 0) + dx-2(0)] 





__& = 
PK = T+p ou 2, 1) + 1 


For 2<i1<= K — 2, 


1 
eo 1, 0). 


do(i — 1) 





Pii-1 =F 


i) 


~, 


T + 


dp(i) + a [di = 1) + do(t = 1)] 





Pi = 


Kew) 


l 


oles ab 





Dij = Id) + dn +25 a 


+ dj-i(i — 1)], it1lsjsK-—2, 


[d;- i41(l — 1) 
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re vat Cee ener ae Peery 








+B 
aa —— [SaK — i, i — 1) + dx-1-:(i — 1)] 
bg es Se Shoot ie oa ea 
ee ay d L 2 i+B d l,l 
K- 
PK-1,K-2 =p - 2) 
_ 8 0K — K-1l ey Ke 
PK-1,K-1 ay SY a 1) tH i+B [Sz(1, K — 2) 
+ do(K — 2)] 
_ _B 24g RT = 
DK-1,K “RIT po K-1)+ Role Boal K — 2) 


DPxexK-1 = Ox-1 
PKK =1-— dx-1. 


A.2 The effect of batch arrivals 


Equations (46) and (47) have to be modified for the compound 
arrival process. From eqs. (38) and (39) we obtain 


Sq(0, t) = > d,(i) = > » dn (i, J )es(J) 


n=0 J=0 


8 


= Ye) x In (i, J) 


= : co(J) Y dnd, ) 


=p \i 
= 6 > co(J) Y (—* ) b(n, J). (48) 


n= 


To evaluate the double sum, let 
= )) O(n, J)eo(7); 
j=0 


it is the probability that n customers arrive during a service interval. 
Let 


z)= x Z"An = ys ze » b(n, j)co(7). (49) 
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Comparing eqs. (48) and (49) we see that the double sum in eq. (48) is 
A(-) evaluated at z = (vy — h)/». 
Standard generating-function arguments yield 


A(z) = C[B(2)], (50) 
where 
B(z) = y z*P{B, = k} 2? 
k=0 — é 
and 


fee] 


C(z) = Yt z/es(j) = err), 


j=0 


Substitution into eq. (50) yields 
x —1 
A(z) = exp (rw 2 I (51) 
=e 





Evaluating eq. (51) at z = (v — h)/v and substituting the result into eq. 
(48) yields 


Sz(0, 1) = dexp SAL 


As before, 
Sa(0, t) =1- Sz(0, L) 


is obtained easily. 


APPENDIX B 
List of Symbols 


The following is a list of symbols and their definitions as used in this 
paper. 
= retry rate 
= one-way propagation delay 
= system capacity in packets 
Poisson arrival rate 
= compound Poisson arrival rate 
= P{no collision} 
= service time (constant) 
= proportion of time bus is occupied 
throughput 
= throughput of an M/D/1/K queue 


average wait (in queue plus in service) of a packet 


variance to mean ratio of compound Poisson process 


Serpe 
I 


3 
EF 


“N 
I! ll 


nN 
I 
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¢ = ejection rate of packets from the bus. 

An asterisk on a symbol means the best value found. The subscripts 
lb, ub, and avg stand for lower bound, upper bound, and average, 
respectively. 
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Implementing and Testing New Versions of a 
Good, 48-Bit, Pseudo-Random Number 
Generator 


By C. S. ROBERTS 
(Manuscript received October 27, 1981) 


In this paper we describe the design, testing, and use of drand48— 
a good, pseudo-random number generator based upon the linear 
congruential algorithm and 48-bit integer arithmetic. The drand48 
subroutine is callable from C-language programs and is available in 
the subroutine library of the UNIX * operating system. Versions coded 
in assembly language now exist for both the PDP-11 and VAX-11 
computers; a version coded in a “portable” dialect of C language has 
been produced by Rosler for the Western Electric 3B20 and other 
machines. Given the same initialization value, all these versions 
produce the identical sequence of pseudo-random numbers. Versions 
of drand48 in the assembly language of other computers or for other 
programming languages clearly could be implemented, and some 
output results have been tabulated to aid in testing and debugging 
such newly coded subroutines. Timing results for drand48 on the 
PDP-11/45, the PDP-11/70, the VAX-11/750, and the VAX-11/780 
are also presented and compared. 


I. INTRODUCTION 


The work described in this paper arose when one day the author 
found himself in need of a good, pseudo-random number generator 
that would execute and produce identical results on two different 
computers (in this case, the 16-bit PDP-11 and the 32-bit VAX-11, 
both manufactured by Digital Equipment Corporation). Good, pseudo- 
random number generators often require multiple-precision arithme- 
tic; hence, to achieve speed they are usually implemented in assembly 
language and are dependent upon the word length of the computer. If 


* UNIX is a trademark of Bell Laboratories. 
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this is not enough of a barrier to portability, add the fact that the 
recoding of a pseudo-random number generator for a different com- 
puter is an error-prone endeavor. The author has himself found bugs 
in several pseudo-random number subroutines that were supposedly 
“correctly” coded. (The author admits to being facetious, but do such 
bugs make the output from a generator more or less random?) 

This paper does not present a magical method to eliminate these 
difficulties. However, it does present enough data and intermediate 
results so that a person may code on any computer a good generator 
based upon 48-bit integer arithmetic and then begin to test the new 
version for bugs. In addition, we describe proper usage of routines 
drand48, lrand48, mrand48, erand48, nrand48, and jrand48—C-lan- 
guage callable functions to generate pseudo-random numbers—cur- 
rently available at Bell Laboratories in the subroutine library of the 
UNIX* operating system for the Western oe 3B20, the PDP-11, 
the VAX-11, and other computers. 

The pseudo-random number generator considered in this paper is 
based upon the well known linear congruential algorithm, e.g., see 
Section 3.2.1 of Knuth.’ The next number in the pseudo-random 
sequence is generated according to the formula 


Xn+i = (aXn + C)aodn n= 0. 


We choose the value of m to be 2“; hence, 48-bit integer arithmetic is 
required. The values of the multiplier a and the addend c are chosen 
as follows: 


a = 5DEECE66Di6 = 273673163155 
C= Bie = 13s. 


While other equally good choices for m are clearly possible, we chose 
2* for the following reasons, based upon matters of taste and judgment. 
The word length of many popular computers is a multiple of 16 bits; 
hence, it seemed wise and convenient to choose logam to be an integer 
multiple of 16. A period of 2*° for the generated pseudo-random 
sequence seemed long enough for most purposes, while 2°* seemed 
dangerously short. Assuming it takes 10~* second to generate a pseudo- 
random number (see the timing results in Section IV), a period of 2“ 
corresponds to 893 years, while 2°° would only require 119 hours for 
the complete cycle to be generated. The argument for 48 bits becomes 
even more compelling if we assume that a future processor or custom- 
ized hardware might be a factor of 10 or 100 faster. 

The value of a, chosen above, was one of the multiplier values 
judged by Coveyou and MacPherson’ t to be satisfactor ¥ according to 
the “spectral test,” one of the most demanding of the canonical tests 
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for overall pseudo-random number-generator quality. The above val- 
ues of a, c, and m ensure that the generated pseudo-random sequence 
will have the maximum possible period, i.e., 2*° (see Theorem A, 
Section 3.2.1.2 of Knuth’). The testing done by Coveyou and Mac- 
Pherson? was demanding enough and the properties of pseudo-random 
sequences based upon the linear congruential algorithm are well 
enough understood’ that further statistical testing of the generator 
was deemed unnecessary. 

The linear congruential algorithm with these choices for the param- 
eters yields a generator that exceeds the requirements of most users 
and is much better than many generators in common use today. 
Obviously, this is not the “optimum” pseudo-random number genera- 
tor; it will not meet all the requirements of any potential user, and the 
author makes no such claims. However, in more than four years of use 
in numerous programs at Bell Laboratories, it has served well, even in 
situations where other generators have failed. 


il. PSEUDO-RANDOM GENERATOR RESULTS 


The pseudo-random sequence generator described and specified in 
Section I was coded in assembly language three times—once on the 
PDP-11 using the 16-bit integer arithmetic instructions “mul” and 
“add,”* once on the PDP-11 using the 64-bit floating-point arithmetic 
instructions “mulf’ and “addf,”* and once on the VAX-11 using the 
32-bit integer extended multiply and add instruction “emul.’* After a 
bit of debugging, all three independent versions finally produced the 
identical sequence of 48-bit pseudo-random integers. For purposes of 
comparison with future newly coded subroutines, a portion of that 
sequence is presented below (in hexadecimal) as Table I. 

Since it is common practice to treat the output from a pseudo- 
random number generator as a pure binary fraction, thus yielding a 
uniform distribution over the interval 0 to 1, the numbers in Table I 
were so treated and then multiplied by 4096 to yield the decimal 


Table I—A portion of the pseudo-random sequence X; of 48-bit 


integers 
1234ABCD330E 657EB7255101 D72A0C966378 5A743C062A23 72534ABF62F2 
5195D97A8D15 E2BCF94AEFFC 03FD3CD49657 9586EFCA2D16 280C61DEF669 
623B341D40CO BOESA9A1L1CB OFI160B4F57A E65CDA1020FD 29DE25BD59C4 
28B8E8F5507F 8876EDD9601E 9AA93190EODI 952BC3577F08 451CD3C24673 
63F661075102 4BIC4CBD49E5 BE0C7218348C 4C6C2C9427A7 135676A8EC26 
67ACF11EB039 DB7DIEFO3E50 F124D606681B A9AF4526958A D8B2A2FFA7CD 
00B48E98A054 765E7C77BBCF 8858368AFI12E C9B2484004Al 43FF29D69E98 
FB95A6FE16C3 4E897866E312 99D1A468DAB5 9BD4C9FFBDIC 3662639AACF7 
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Table II—Pseudo-random integers Y; corresponding to the X; in 
‘Table | 


291 1623 3442 1447 1829 1305 3630 63 2392 652 
1571 2830 241 3685 669 651 2183 2474 2386 1105 
1599 §=1201 3040 = 1222 309 1658 3511 3858 2714 3467 

11 1893 2181 =93227) «1087 §=964025) «1256 )=—2461 = 2493 870 
3628 1247 622 1383 1587 2636 3086 2472 2177 = 1881 
2672 1340 3876 1507 3866 30 2115 «1117 99 2424 

839 3595 243 1068 1240 3651 2040 2908 1173 3542 

2767 =1877 =3930) )=—3173) Ss 1542 936 1452 1230 2743 2944 





integers in Table II. More precisely, each integer Y; in Table II was 
computed from the corresponding value of X; in Table I by the formula 


Yi = L(Xi) (2) (4096) ] = [2-°°X:]. 


For some purposes, Table II may be more convenient or appropriate 
than Table I. 

Tables I and II should be valuable to a person attempting to code 
and test a new version of the generator subroutine. After initializing 
with Xo = 1234ABCD330Ei¢, the new version, if correct, should repro- 
duce Table I (or Table II). The fact. that three independent codings of 
the 48-bit linear congruential algorithm gave the same results gives 
the author substantial confidence that Tables I and II are indeed 
accurate. In addition, Lawrence Rosler of Bell Laboratories has inde- 
pendently verified Tables I and II using both C- and assembly-language 
code he produced for the Honeywell H6000 series of 36-bit computers.” 

The author has found that an empirical but effective heuristic for 
quickly evaluating the general quality of any pseudo-random number 
generator is to display the output bits on a bilevel display device, and 
this has been done in Fig. 1 for the generator defined in Section I of 
this paper. (A bilevel display device consists of an n X m rectangular 
array of dots, each of which can be made either white or black.) Figure 
1 was generated using only the leftmost 32 bits of each 48-bit X;; 2° 
values of X; were generated, thus giving a total of 2’® bits. These bits 
were displayed on a 512 X 512 crT display device with a 1 being 
displayed as white and a 0 being displayed as black. Good quality 
pseudo-random number generators produce displays similar in ap- 
pearance to Fig. 1—random salt and pepper effect with no visible 
patterning. For comparison, Fig. 2 shows the display produced by an 
inferior generator.* Note the prominent nonrandom patterns visible in 


* The author apologizes for the generally poor quality of Figs. 1 and 2 and assures the 
reader that the result is much more striking and apparent when the original CRT or 
photographic plates are viewed. Glossy prints such as these simply do not reproduce 
well. 
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Fig. 1—Bilevel dot display generated by drand48. 


Fig. 2, which was produced by a 1977 version of the library routine 
“rand,” described in Section III of the UNIX Programmer’s Manual. 
(This 1977 version used the linear congruential algorithm with m = 
2). the version of “rand” currently in the UNIX library (1982) is better 
since it uses m = 271.) 


lll, SUBROUTINES FOR USE WITH THE C PROGRAMMING LANGUAGE 


As an example of how the calling sequences to the pseudo-random 
number generator might be designed, we now describe some routines 
that were coded for use with the C programming language.® So far the 
author has himself implemented versions of these routines only for the 
PDP-11 and VAX-11 computers. However, a version for other com- 
puters that support the C language has been implemented by Rosler’ 
and is now in use on the Western Electric 3B20, the Honeywell H6000, 
the IBM-370, and the Motorola MC68000 computers. 

The pseudo-random number generator was implemented in assem- 
bly language as one subroutine having nine entry points. Six of. the 
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Fig. 2—Bilevel dot display generated by an inferior pseudo-random number generator. 


entry points generate the next pseudo-random X; in the sequence and 
then convert the leftmost bits of it into the particular type of data 
item desired—floating-point fraction, positive integer, or signed inte- 
ger. The entry points erand48, nrand48, and jrand48 can be called 
without first having to invoke a special initialization entry point. Calls 
to the entry points drand48, lrand48, and mrand48 should be preceded 
by at least one call to one of the initialization entry points—either 
srand48, seed48, or lcong48. 


3.1 Function drand48 


Example C usage: 
double fnext, drand48( ); 
fnext = drand48( ); 

The next pseudo-random number is returned as a non-negative 
binary fraction in double precision floating-point format; i.e., the value 
returned is 2~“°X;. The values returned are uniformly distributed over 
the interval [0, 1). Thus, in the above C-code example, the value range 
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for fnext is 0 = fnext < 1. Either srand48, seed48, or lcong48 should be 
invoked before calling drand48. 


3.2 Function lrand48 


Example C usage: 
long int Inext, lrand48( ); 
Inext = lrand48( ); 

The next pseudo-random number is returned as a non-negative 
integer in long integer format. The long integer is formed by taking 
the leftmost 31 bits of X;, ie., the value returned is |2~'’X;]. In the 
above C-code example, the value range for Inext is 0 < Inext < 2”. 
Hither srand48, seed48, or lcong48 should be invoked before calling 
lrand48. 


3.3 Function mrand48 


Example C usage: 
long int mnext, mrand48( ); 
mnext = mrand48( ); 

The next pseudo-random number is returned as a signed integer in 
long integer format. The long integer is formed by taking the leftmost 
32 bits of X; to be a signed integer in two’s-complement format. Hence, 
the leftmost bit of X; determines the sign of the output value. In the 
above C-code example, the value range for mnext is —2°? < mnext < 
21, Kither srand48, seed48, or lcong48 should be invoked before calling 
mrand48. 


3.4 Functions erand48, nrand48, and jrand48 


These functions are identical to drand48, lrand48, and mrand48, 
respectively, in the characteristics of the data value returned. The 
difference is that erand48, nrand48, and jrand48 allow, and require, 
the calling program to provide the storage for the current 48-bit X; 
value, while drand48, lrand48, and mrand48 provide this storage inter- 
nally for themselves. For those programs that require only a single 
stream of pseudo-random numbers, drand48, lrand48, and mrand48 
are a little more convenient and simpler to use. However, erand48, 
nrand48, and jrand48 allow multiple “independent” streams of pseudo- 
random numbers to be generated, i.e., subsequent numbers in each 
stream will not depend upon how many times the routines are called 
by other parts of a program to generate numbers for other streams. 
This property can be a big asset for certain statistical computations 
and for program debugging. 


3.4.1 Function erand48 


Example C usage: 
double fnext, erand48( ); 
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short int xsubi[3]; 
fnext = erand48(xsubi); 


3.4.2 Function nrand48 


Example C usage: 
long int Inext, nrand48( ); 
short int xsubi[3]; 
Inext = nrand48(xsubi); 


3.4.3 Function jrand48 


Example C usage: 
long int mnext, jrand48( ); 
short int xsubi[3]; 
mnext = jrand48(xsubi); 


3.5 Function srand48 


Example C usage: 
long int seedval; 
seedval = 0x1234ABCD; 
srand48(seedval); 

This is an initialization entry point that sets the value of Xo; the 
multiplier a and the addend ¢ are set to the values specified in Section 
I. The leftmost 32 bits of Xo are taken from the argument passed to 
srand48 when it is called (seedval in the above C-code example). The 
rightmost 16 bits of Xo are arbitrarily set to 330E,,.. Hence, the above 
C-code example sets the value of Xo to 1234ABCD330Ehig¢. 


3.6 Function seed48 


Example C usage: 
short int seed16v[3], *shp, *seed48( ); 
seed16v[0] = 0x330E; 
seed16v[1] = 0xABCD; 
seed16v[2] = 0x1234; 
shp = seed48(seed16v); /* pointer to previous X; stored in shp */ 
/* or alternatively, */ 
seed48(seed16v); /* pointer to previous X; just ignored */ 

This is an initialization entry point that sets the value of Xo to the 
48 bits specified by the argument passed to seed48; the multiplier a 
and the addend c are set to the values specified in Section I. In 
addition, the previous value of X; is automatically stored in a 48-bit 
internal buffer, used only by seed48, and the value returned is a pointer 
to this buffer. The argument is an array of three 16-bit integers. The 
above C-code example sets the value of Xo to 1234ABCD330Eis. 

The pointer to the previous value of x; is useful if a restart from that 
point is desired at a later time. The 48-bit X; value must be copied out 
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of the internal buffer before seed48 is called again or it will be 
destroyed. The following code sequence, for example, restarts a pro- 
gram with a saved value of X;. 

short int newx[3], oldx[3], *shp, *seed48( ), 7; 

shp = seed48(newx); /* initialize with whatever is in newx */ 

for (i = 0; i < 3; i++) oldx[z] = shp[z]; /* save previous X; in 
oldx */ 


seed48(oldx); /* reinitialize with oldx */ 
3.7 Function Icong48 


Example C usage: 
short int param[7]; 
param[0] = 0x330E; param[1] = OxABCD; parem [2] = 0x1234; 
param[3] = O0xE66D; param[4] = 0xDEEC; param[5] = 0x5; 
param[6] = OxB; 
lcong48(param); 

This is an initialization entry point that sets the values of Xo, a, and 
c; hence, different 48-bit linear congruential generators may be created 
by specifying different values for the multiplier a and the addend c. 
The argument passed to lcong48 is an array of seven 16-bit integers. 
The first three specify a 48-bit value of Xo; the next three specify a 48- 
bit value of the multiplier a, and the last one specifies a 16-bit value of 
the addend c. Hence, the above C-code examples set Xo) = 
1234ABCD330Eie, a= 5DEECE66S 6, and c = Bie. 


IV. TIMING RESULTS 


Table III presents the time required to generate, using function 
drand48, 10° pseudo-random numbers on five different computer hard- 
ware configurations. More precisely, Table III gives the time required 
to execute the following short C-language program: 

main( ) { 
register int 2, 7, h; 


Table III—Time required to generate 10° 
pseudo-random numbers 

Computer Time (sec) 
PDP-11/45 440 
PDP-11/45 with Fabritek cache 340 
PDP-11/70 162 
VAX-11/750 200 
VAX-11/780 96 
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double nfd, drand48( ); 
int lh, 1); 
short int nn[500]; 
long int seedval; 
seedval = 0x1234ABCD; srand48(seedval); 
for(i = 0; i < 500; i++) nn[z] = 0; 
li = 1000; lj = 1000; 
nfd = 500; 
for(t = 0; 1 < li; ++) for(j = 0; 7 < lly; j++) 
{h = nfd*drand48( ); 
nn(h)++; 


} 


Execution of drand48 accounts for 80 percent, approximately, of the 
times listed in Table III. For the timing tests on the PDP-11/45 and 
PDP-11/70, our version of drand48 that employs the floating-point 
arithmetic instructions was used, since it is substantially faster than 
the version that employs the integer arithmetic instructions. The first 
three entries in Table III represent separate executions of the same 
binary machine code; hence, the time differences reflect brute-force 
speed differences of the processor and/or memory hardware. The same 
is true for the last two entries in Table II]. The comparison between 
the PDP-11 and the VAX-11 is, however, more subtle since a complete 
recoding of drand48 is involved here. Essentially, when comparing the 
VAX-11 with the PDP-11 in Table III, we are comparing the time 
required to accomplish the identical “function” on both computers 
using a near optimally coded version of drand48 on each machine. The 
faster time for the VAX-11/780 must be attributed to at least two 
factors: a more powerful and capable instruction set than the PDP-11, 
and faster basic hardware. 


V. DISCUSSION 


We have described the design of drand48, a good, pseudo-random 
number-generator subroutine, and presented enough output data so 
that this subroutine can be recoded for a new processor and quickly 
tested for bugs. While the reproduction by a new drand48 implemen- 
tation of either Table I or II surely is not a logically complete test of 
correctness, it is, at least, a first-order indication that will catch many 
of the common types of programming errors. The drand48 subroutine 
is not a panacea, an “ultimate” generator for all purposes. There are 
numerous other algorithms for generating pseudo-random sequences, 
and each one has its advantages, disadvantages, advocates, and de- 
tractors. The most sophisticated requirements will always have to be 
met by custom design of the generator and extensive statistical testing. 
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However, it is not at these most sophisticated users that drand48 is 
aimed. 

We live in an age when new computers are coming into existence 
very rapidly. The UNIX operating system and the C programming 
language are already running on the Western Electric 3B20 processor, 
and the 3B05 is not far behind. Other new processors have been or are 
being introduced by other manufacturers. As existing programs mi- 
grate to these new machines, it would be desirable for the associated 
pseudo-random number generator to continue to produce the same 
output sequences. A subroutine written in a portable, high-level pro- 
gramming language is one possible solution, and drand48 has proven 
itself amenable to such an approach. However, for applications in 
which speed is of paramount importance, subroutines written in assem- 
bly language are useful. (The implementation of drand48 in the 
“portable” dialect of C ran significantly slower’ than the assembly 
language versions described in this paper.) The drand48 subroutine 
has worked well on the PDP-11 and VAX-11 computers; the author 
hopes that the information provided in this paper will make it possible 
for new implementations of drand48 and its variants to be accurately 
coded for at least some of the new computers that will ultimately 
become popular in the future. 
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Some Extremal Markov Chains 


By J. E. MAZO 
(Manuscript received March 10, 1982) 


Using a Markov chain model for the motion of a particle through 
a V-node network, we consider the quantities nj, which are the 
average number of steps taken by the particle in traveling from an 
originating node, i, to a destination node, j. A figure-of-merit, N, for 
the entire network is introduced by averaging nj over i and j. We 
investigate which networks minimize or maximize N, either when no 
restriction is placed on the Markov chain, or when we restrict it so 
that it corresponds to random routing. By the latter we mean that at 
each node the particle “selects at random” lines from an undirected 
network graph. We show that for random routing, the complete graph 
has N = (V — 1) and ts the minimizing graph. The maximizing graph 
is unknown, but we establish that the worst behavior of N increases 
at least with the cube of the number of vertices, but no worse than the 
3.5 power. Properties of the class of graphs known as barbells are 
useful here. The minimizing unrestricted chain corresponds to plac- 
ing the nodes on a circle and proceeding unidirectionally from one 
node to the next. Here, N = V/2. 


I. INTRODUCTION AND RESULTS 


Our work is set against the background of the Markov chain model 
for the movement of a “particle” or “message” through a network of 
V nodes. Thus, suppose a particle originates at node z and is destined 
for node 7 ¥ i. The particle wanders through the network toward its 
destination via a Markov chain, going to node n from node m with 
probability pn. The quantity Dmn is the (m, n) element of the transition 
matrix P of the Markov chain whose states correspond to the nodes of 
the network. Denote the average number of steps required by the 
particle to reach its destination by ny, and assume that any node is 
accessible from any other (the Markov chain is irreducible). Then we 
introduce the figure-of-merit, N, for any such chain by averaging nj 
over the (V — 1) possible destinations and V origins of particles: 
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1 VV 
N WV) 22 ny. (1) 
jet 
Our problem is: Which chains minimize or maximize N, either for a 
random-walk Markov chain or for an unrestricted chain? 

An unrestricted chain means that no restrictions are placed on the 
transition matrix P other than irreducibility and the requirement that 
pi = 0 for all 1. The random-walk chain is a special case of interest and 
is defined as follows. Draw any connected undirected graph on the V 
nodes. If m and n are not joined by an edge of this graph set Dmn = 0, 
while if they are so joined set Dmn = 1/Im, where 1, is the number of 
edges of the graph leaving node m. Thus, at any node, the particle 
chooses from the available lines “at random.” This random walk on 
the graph has previously been used by Kleinrock in Ref. 1, where it is 
referred to as random routing. 

Our results are: 

(t) For random routing, the average number of steps N is mini- 
mized when the graph of the network is the complete graph. For this 
case N= (V— 1). 

(tt) For the unrestricted chain, N is minimized when the nodes are 
placed consecutively on a circle and we proceed deterministically from 
1 to 2 to 3, etc. Here N = V/2. 

(iit) By choosing pis = pai = 1 — €, e > 0, N can be arbitrarily large 
for the unrestricted chain (when V > 2). 

(itv) We have not been able to determine the “worst” graph for 
random routing, but we can show for large V that O(V*) <= Nworst S 
O(V*°). The barbell graphs of Mitra-Weiss” and Landau-Odlyzko® are 
good candidates for bad graphs. 


ll. THE MINIMAL WALK 


In this section, we demonstrate that the complete graph is the only 
graph which minimizes N for the random-walk problem. 

The fact that the symmetry of the complete graph requires n; = N 
for all 1 47 allows a simple demonstration of the fact N = (V — 1) for 
this case. If the particle originates at node i, we go directly to our 
destination j with probability 1/(V — 1), requiring only one step, or we 
go to another node with probability (V — 2)/(V — 1) and then require 
an average of (1 + N) steps to reach j from i. Thus, 

1 V-2 
N= Fat yoy tA. (2) 
Solving (2) yields N = (V — 1). 
To show N > (V — 1) for other graphs is more involved. We first 
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give properties of those transition matrices that correspond to random 
routing, ending with (12) which gives the stationary probabilities for 
those chains. We then derive (24), which is an expression for the 
average of the first passage times with which we are concerned. Finally, 
we obtain our result by giving a lower bound to (24). 

Standard results on Markov chains or positive matrices may be used 
without reference when needed. For the former, the reader may consult 
Ref. 4, while Ref. 6 is a useful source for matrices. 

In this paper, we denote transposition, complex conjugation, and 
hermitian conjugate by the symbols 7, *, and ', respectively. 

For random routing, the transition matrix P is given by 


P=DA, (3) 


where A is the symmetric adjacency matrix of the graph and is defined 
by 
1 if i 47 and i and / are joined 
a; = by an edge 
0 otherwise, 


and D is a diagonal matrix with diagonal elements 
dis = 1/ki, (4) 


l; being the ith row sum of A and also, therefore, the number of edges 
of the graph incident on node i. The matrix P is stochastic, that is, it 
has nonnegative elements and the rows sum to one. Further, the 
assumed connectivity of the network graph implies that P is irreduci- 
ble. The facts that P is stochastic and irreducible imply that the largest 
positive eigenvalue of P is unity and has multiplicity one. All other 
eigenvalues of P have modulus less than, or equal to, one. Set A; = 1 
and let Az, ---, Av be the remaining eigenvalues of P. 

We now investigate the eigenvalue and eigenvector structure of P. 
The matrices P = DA and Q = D'”AD"™ differ by a similarity 
transformation (P = D’’?QD~‘”) and so have the same eigenvalues. 
Since Q is real symmetric, the A; are real. Further Q has a complete set 
of orthonormal eigenvectors o” 


Qo” = dp” 
pig = 5;. (5) 


If we denote the eigenvectors of P and P ? which correspond to A; by 
U” and W”, respectively, then we clearly have 


U%= ai D6 
WwW = a7 (D724 " 
WeTUY = 5,. (6) 
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The a; in (6) are any convenient constants. In addition, we have the 
spectral representation 


V 
P= Y¥ )\UOwer, (7) 
i=1 


Since the rows of P sum to unity, the eigenvector U™ may be chosen 
to be 


— 


UM = ae (8) 


The eigenvector W™, 
PTW = WwW” (9) 
is then the stationary probability vector for the Markov chain 


Pi 
P2 
Ww = : (10) 


Pv 


obeying the normalization of (6). 
Equation (9) is easy to solve when P corresponds to random routing. 
Define a vector Y by (Y); = 1; so DY = U™. Then, using (3), 


PTY = ADY = AU" =Y. (11) 

Thus, the stationary probability vector for the chain has components 
l; 

i=—>0, 12 

Pi= oF (12) 


where L equals the number of edges in the graph. Using (12) in (6), we 
find 


(6); = Vpi. (13) 

We next derive the expression (24) for N that will be our point of 

departure. Let f(7), n = 1, 2, --- be the first passage probability from 

node i to node/ at time n. Then, if P” is the nth power of the transition 
matrix P, we have (Feller, in Ref. 4, p. 352) 


fil) = Py (14) 


fila) =(P")y— YP sfyln—B), n= 2. 
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In terms of the generating functions, defined for |s| < 1 by 


Fi(s) = 2 fiy(n)s” (15) 
Py(s) = ¥ (P”)ys", (16) 
n=1 
(14) is equivalent to 
Pi 
Fy(s) ee Is| <1. (17) 
JJ 


Since all eigenvalues A; of P satisfy |Ai| < 1, (16) shows that (17) may 
be rewritten for 1 #7 as 
(-sPlj iFj 
Fy; (s) = {I — sP]j’ [s| < 1. (18) 
Denoting the ith component of U™ by US”, and similarly for W™, 
set 


UY WY = bY (19) 
and note from (8) and (10) that 
by = By. (20) 


Then, using the spectral a (7), (18) may be written 








—21—sA 
Fij(s) = 2 ee (21) 
;+(1l—s bY 
Dj + ( ) x T—aA, # 


In this form, F’;(s) may be analytically continued to a neighborhood of 
s = 1. Then, from (15) and a standard Abelian theorem we have 
dF ,;(s = 1) 


ds 22) 


ny = Y nfy(n) = 
n=1 
Using (21) to (22), we rae 


ny == Sy bY — BY). (23) 





Finally, using the definition (1) and relations (6) and (19), we see that 
the average number of steps, N, required to reach a destination with 
random routing for V nodes may be written 

2 

| (24) 
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(nu) 


vb; 








N= 1 . 1 ea ti o@pP— oy 
Va=-Lol=<A, J=1 Di ‘ 


oY 





Again, p; are the stationary probabilities of the Markov chain governed 
by transition matrix P = DA. The i,, » = 2, are the nonunity 
eigenvalues of P and Q = D’?AD"”, the latter matrix having ortho- 
normal eigenvectors ¢”). Also 6”, the eigenvector of Q associated with 
the eigenvalue ); = 1, is given by (13). 

We now lower bound the right member of (24). A few known facts 
about the quantities 4,, ¢™, and p; will be exploited, but their great 
interdependence (they are all determined by P) will be ignored. 

Set 











J= Dj Vv J Vp; 
and rewrite (24) as 
1 VY a 
N=—_—— fs, 26 
V-l.21-A, (26) 


We assert a, > 0, since, by Cauchy’s inequality, 


oi 2 + ( lo we) ( )- [oe 2 
— == 1 : (27) 
j Vp; Vv * Pj * aD Pj 


However, equality can hold only if 


(u) 
Ps 


vbi 


1 








Vv 


x], 


that is, if 
aio vp = 0. 


Since 6 is orthogonal to ¢” for p = 2, this cannot happen. 

With the positivity of the a, established, minimize (26) over the A, 
holding the a, > 0 fixed. During this minimization, we only impose 
two constraints on the \. First, A, < 1, and second }\)~2 A, = —1. The 
first has been amply discussed already while the second follows from 
the fact that the diagonal elements of the transition matrix P are zero 
and so 


Vv 
O=trP=Y A. (28) 
p=l1 


Since A, = 1, the second constraint follows. Introducing the second 
constraint via a Lagrange multiplier (call it —8), we find that the 
unique stationary point of 


t Incidentally, if we write ny = >, 7!?/(1 — A,) it is not necessary that rf” = 0. 
However, ry) + 7)? 
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Vv 
Yay 7 BE (29) 


p=2 


satisfying the two constraints, occurs when 





V 
A.=1- Va, | +> . (30) 
x Va, 
p=2 
Therefore, applying (30) to (26) gives 
1 r ; 
N= Va, ) . 3 
VV—1) wo (k wn) oe 


Next, use the fact that the arithmetic mean exceeds the geometric 
mean: 


V-1 
ae V Ces 3 vm) 


= (1 fa)" i -—> (11 a) (32) 


Motivated by the definition of the a, [see (25)], introduce the real 
symmetric matrix M having elements 





1 1 1 
M; = a by — Va. (33) 
Then, 
a, = 6 *Mgo™. (34) 
Note that 
me” = ¥ (4-52) vpn, (35) 
j=1 \Bi Vv Vip; 


while we have already shown 
g'Mo>0 if (¢,¢%) =0. (36) 
Thus, M is positive semi-definite, having precisely one zero eigenvalue. 


Fix p; > 0,1 = 1, ---, V, and proceed to lower bound (32) by writing 
Vv Vv 
I] a. 2 min [] YtMy”, (37) 
p=2 


p=2 


the minimum in (37) being taken over all orthonormal sets of vectors 
Y” which satisfy 
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WW) = by 
(Y, 6) = 0, fl, v= 2, +++, Vz (38) 


An inequality of Ky Fan° implies that under the conditions of (38) we 
have 


Vv 
min [J Y?'My” = pape, - ++, pvr, (39) 


v=2 


where p;,1 = 1, ---, V— 1 are the nonzero eigenvalues of the matrix M, 
which was defined in (33). Thus, 


1 


V-1 — 
N= yO [uipe +++ py-a]”?. (40) 


If g; are the components of an eigenvector of M associated. with 
eigenvalue p, the equation 


Vv 
x My 8; = ugi (41) 
= 
yields, when (33) is substituted for M,,, 
1 
gi=k Sar (42) 
si (5. #) 
Di 
k being a normalization On Since for p # 0 we must have 
0= 5 gal = ¥ abi (43) 
we obtain, using (42), 
is 1 
» iar rmenare it) (44) 


Go) 


Equation (44) determines the (V — 1) positive eigenvalues p;. Clearing 
fractions in (44) yields 


i eer | 
BMG *)- 5 
Ji 
with 
D= TI (=-n). (46) 
i=1 \ Di 


il é denominator D in (45) lay ve discarded and then the product or 
the nonzero eigenvalues of M is simply read off the remaining poly- 
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nomial. Since the coefficient of #”~! in the numerator polynomial in 
(45) is V(—1)""}, 


— 
ae 


Vv 
i (47) 


ji 


1 
Mises bv = a 
Dj 


Vi 


ae 


The product of positive numbers having a fixed sum is maximized 
when each number is the same, so we have 


V y-1 
(1 P| 
= 
pi Pi V- 1 
ini 
or 
LX yvait\ 
eva ess —— : 48 
[1 [y-1 ve (; =) (48) 
Finally, the minimum of (48) subject to ¥/4; p; = 1 occurs when p; = 
1 
y’ or, 


La PVely 


Es eh = — V-1 
peat eee Lv-1 = v ; _ 1 V . (49) 
V 
Combining (49) with (40) produces our desired result: 
N=Y (yryri a (v— 1, (50) 


It is easy to work backwards through the argument to see that the 
complete graph is the only one that can achieve N = (V — 1). Clearly, 
equality in the last step of the argument can only be attained if p; = 


+: Substituting this into (24) and using the orthonormality of the 


o¢” yields 
V-1" 1 
Ay ese ST es 
ee. (51) 
which only equals the minimum when A, = ay ph = 2, +++, V. 
Using this, and the fact that a =1 = (7) yields 
P=U9WT — (yyy) T 
U'wW o4 >) UW 
= UPwot + us Ow or 5 
=-y ) aaa Y (52) 
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The first dyad sum in the right member of (52) is the identity matrix, 
while the last dyad in (52) is the matrix that has all elements equal to 
1/V. Thus, P is precisely the transition matrix associated with the 
complete graph. 


lt. THE MINIMAL CHAIN 


Now we turn to finding the minimum value of N for the unrestricted 
chain. In the introduction we stated that for this we should place the 
nodes on a circle and go unidirectionally from one node to the next. 
Clearly, in this case, }; n does not depend on i, and, in fact, 


N : 5 ; [1+2+3+ + (V—-1)] y (53) 
= _ hny=-_ eee _ =—_—,. 
Voto: Vad 2 
We must now show that no other setup can do as well. For this end, 
expressions such as (23) are not useful since the A,, U“ and W“ may 
be complex and it would be difficult to pick out even real combinations, 
let alone positive ones that might be lower bounded. Rather, we retreat 
to an obvious generalization of the argument we used to derive (2). 
This generalization reads 


j= 1, ees, V 
ny = pi + x pir(1 + ny), re F (54) 
hyéj 
or, since P is stochastic, 
i, (= 1, «+>, V. 
ni — x DikNkj = 1, a i (55) 


hej 


Equation (55) is our new point of departure. 

Let NY, 7 = 1, ---, V, be the (V — 1) dimensional vector whose 
components are nj, with j fixed and i ¥ j. Also let P(j), j = 1, ---, V, 
be the (V — 1) X (V — 1) matrix obtained by crossing out the jth row 
and the jth column of P. The Pi J) still have positive elements, but the 
irreducibility of P implies that not all rows of P(7) can sum to one, 
and so the largest eigenvalue of P( J) is strictly less than one. The 
equations represented by (55) may now be written 


[I -— P(j)IN™ = u, J=1,---,V, (56) 


where u is the (V — 1) dimensional vector having all components 
unity. From (56), 


NY = (1 - P(j)Ju= [I+ Pj) + Pj) +---Ju. (57) 
if P2( J) denotes the (7, s) element of the kth power of Pi J), then (i) 
and (57) yield 
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1 V V-1 ow 


NW ea ee) 
where by P°(j)) we mean the identity matrix for (V — 1) dimensions. 
Equation (58) succumbs to the application of the following: 

Lemma: Let P be any stochastic V x V matrix and let P(j), j = 
-, V be the (V— 1) X (V— 1) matrix obtained by crossing out the 
jth row and jth column of P. Then 


i 0 for k=0 
XL Pr) = eae for 0<k<V-2. 


J=1 r,s=1 


(59) 


Proof: The positivity of the sum for all & is trivial since Pi J) has 
nonnegative elements. If k = 0, P°(7) is the identity for all 7, so the 
sum of all its elements if (V — 1), and we obtain V(V — 1) when we 
sum over 7. Next consider k = 1 and make use of the stochastic nature 
of P, that is }'; Ps = 1. 


LLP (i) = % Pom dd (1 — Pry) 


=V(V-I)-YUPy+Y Pr = VV — 2). 


J r 


Now proceed by induction assuming that the lemma is true for k and 
show it true for (k + 1) 


V v-l 
2 >, PH) i 2 ay PryngProng +** Prysinnss 
J=1 rs=1 JO mr) 
Npyorh 
= » a PrynpProng ee Prarie 
J mA) 
ee 
ae > NN oe Pay 
J mAy 
fies 
SVV=b= 1ST So Pie R10) 


J Ny ,Ng-+- 


The inequality follows from the induction assumption (true for k) and 
is further strengthened by extending the range of summation on the 
negative term. In the negative term, the sums over J, 7z41, «++, M2 yield 
unity, and, finally, the sum over n, gives V. The conclusions of the 
lemma follow. 

Now apply the lemma to (59). 
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1 v-2 
ek 17 > bprz | 


k=0 JI ns 
V—-2 
MVR) => (61) 


= WV) ob = 


as desired. 

The condition for equality in (61) obviously requires that 
» 2, Pang 0 eB nj ~ = oe Pome 
Jj ny J MM: 

nyxti 


» Pay R=1+,V—1. (62) 


Since all terms on the right member of (62) are nonegative, any term 
present there but not appearing on the left must be zero. In particular, 
we must have for any J 


Py =0 if k=] (63) 
Y Ping +++ Pn,j = 0 k=2,---,V—-1. (64) 


These equations state that it is impossible to return to any initial state 
J in less than V steps. This, plus irreducibility, implies the unidirec- 
tional movement or a circle. 

The reason why the lemma is exact in this case is that each Pc J) is, 
then—except for a reordering of nodes—a canonical Jordan block with 
all zeros in the matrix except for (V — 2) ones on the appropriate off- 
diagonal. 


IV. THE WORST WALK 


We have been unable to describe the worst setup for random routing. 
For V = 3, 4, and 5, numerical work shows that arranging the vertices 
on a straight line gives the worst cases. In fact, for V nodes on a 
straight line it is possible to show that 


V?-1 
3 > 
which is significantly worse than (V — 1), the best attainable with 


random routing. However, for large V one can do worse than (65). Our 
result for this situation is 


O(V?) = Nworst = O(V*”). (66) 
In (66) Nworst denotes the largest value of N obtained from any of the 


connected network graphs on the V vertices. 


For the lower bound, assume V = 6m — 1 and consider the bar- 





N= (65) 
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Fig. 1—An 11-node barbell. 


bell graphs described by Landau-Odlyzko.® This class of graphs has 
(2m — 1) nodes connected in a straight line with complete graphs of 
2m nodes attached to each end of the line by a single edge of the graph. 
If m = 1, this only describes a straight line, but for m = 2 or greater, 
the barbell nature is evident. The case for V = 11 is shown in Fig. 1. 


In Ref. 3 the authors show that for such graphs do, the second largest 
eigenvalue of P = DA, satisfies 


Y 





Ag 2=1-—- vy (67) 
with y = 54+ O (5). Then from (26), (67), and (34), 
1 a2 2) (tag 4 (2) 
=F -i1L 5, = Oe Mo 
> O(V?)uy-1, (68) 


piv-1 being the smallest nonzero eigenvalue of M. Equation (44) shows 


(69) 





v1 > . 
max Di 


Since the nodes of the complete graphs in the barbell each have O(V) 
incident edges and there are O(V’) edges in the barbell, (12) implies 


1 
that max p; = O (5). Equations (68) and (69) then give 


N= O(V*)O(V) = O(V’). (70) 


Although we will not give the details here, it is possible to show that, 
in fact, N = O(V’) for barbell graphs. 

Our upper bound will be based on (57), but first we need the result 
that the largest eigenvalue of P(/) satisifes 


NG) = MIP ietS0 (a): (71) 


Just as the matrix Pi J) was defined in Section III, now introduce 
matrices D(j) and A(j) by eliminating the jth row and column of D 
and A, respectively. Then, 


P(j) = D(j)A(V). (72) 
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Also set 
OV) = D?(jAV)D(V) (73) 


so that (7) is symmetric and has the same eigenvalues as P(/’). In 
fact, we have 


MLP(7)] = AfQ(7)] = max y'Q()y 


= max z'A(j)z= max xtAx. (74) 
Vhze1 x;=0 
Shxt= 
Note that in (74) we have returned to the V X V adjacency matrix A 
by introducing the constraint x; = 0. If S is the set of those (ordered) 
pairs of indices which corresponds to vertices connected by an edge of 
the graph, we have! 


1 
XTAX= Yoo xXmitn = = Vi [x2 + x2 — (Xm — Xn)*] 
S 


(m,n)ES 2 
2 1 2 
=P InXm — = ¥; (Xm — Xn)”. (75) 
m 2 Ss 
Using (75) in (74) we easily obtain 
; ‘oe 
Aj) =1—-—= min Y (xm — xn)’. (76) 
2 x;=0 S 
Yiix?=1 


Let x; be the component of x having the largest square. Then 


1 =) dix? < (V— 1lmaxXk, (77) 
or 
ee (78) 
= lmax(V — 1)’ 


where Imax (or Jmin) is the maximum (or minimum) of the J, i = 
1, ---, V. 

Now, from the connectivity of the graph, vertex j is attached to 
another vertex ¢. So, using (76), 


Aj) S1— x?, (79) 
since x; = 0. If ¢ = k, (78) and (79) yield 


, 1 
AJ) S Py (¢=k). (80) 


If t ~ k, there exists a chain of r distinct edges joing vertices ¢ and k. 


t The remainder of this demonstration is entirely inspired by Ref. 3. 
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Clearly, r = d, where d is the diameter of the graph. Then, using the 
basic trick of Ref. 3, 


Xk — Xe = (Xp — Xn.) + (Xn, — XK.) + eee + (XK, — Xe) (81) 


and so, by Cauchy’s inequality, 


rl Q 


(x, -— x)’ s 5 » (Xm — Xn)? S x (Xm — Xn)”. (82) 


Equations (76) and (82) thus yield 
(xz — xt)? 
d 


Combine (79) and (83) by averaging and then minimize over the 
numerical value of x; to obtain, with (78), for t ¥ k, 


Ay) S1— (83) 


i 1 2 (Xk = x2)? eet 1 
Ai(J) =1 3 E aoe =1 21 + d(V— DImax (84) 


Both cases (80) and (84) are included in 

1 1 
a | -— 
2(1 + d)(V — 1) Imax 2V”’ 
since d = (V — 1) and Imax = (V — 1). 


To complete the upper bound on N, start with (58) and write 
(suppressing the j-dependence ss the matrices in the notation) 


A(j) S1- (85) 


1 
N= 5 Ewe Pu) 
__ ler ; 
mv 5 DM - PTI 


rs 7 | D[1 ms Q)"D? | 


=p Dv*|| ||D-? || |Z - OT" | 
his: 


< 1 


From Jmin = 1, lmax = (V — 1), (85) and (86) we thus have 
Ns2V°VV-1, (87) 
as desired. 
Is it possible that the complete graph is optimal for random routing 
because adding any edge to a graph decreases N? No. Many edges 


must be added to the straight-line graph to form the barbell, but the 
former has N = O(V’) while for the latter, N = O(V°). 
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Quality Evaluation Plan Using Adaptive Kalman 
Filtering 
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(Manuscript received March 5, 1982) 


An important function of the Bell Laboratories Quality Assurance 
Center and the Western Electric Quality Assurance Directorate is to 
audit the quality of the products manufactured and the services 
provided by the Western Electric Company to determine if the in- 
tended quality standards are met. Until the sixth period of 1980, the 
t-rate system was used to make inference on the product quality. 
Starting the seventh period of 1980, the Quality Measurement Plan 
(QMP) has been implemented. The QmP is based on an empirical Bayes 
model of the audit-sampling process using the current and the pre- 
ceding five periods of data. Because it ignores the time order of the 
data, it is slow in responding to drifts in the process mean. The — 
Quality Evaluation Plan (QEP) has been designed to take into account 
the time order of the data and to be more sensitive to drifts in the 
process mean. In this paper we present the Quality Evaluation Plan, 
which uses the entire time series of data on a given product to 
determine if that product meets the quality standard. The time series 
is modeled by a stochastic process, which allows for the possibility 
that the process mean may drift or fluctuate around a fixed value. An 
adaptive Kalman filtering theory is developed for filtering out the 
sampling variance and obtaining the best estimate of the true defect 
index and its confidence interval. Thus, in QEP the best estimate of 
the true defect index is obtained by a combination of adaptive 
exponential smoothing and shrinkage to the mean. The QEP compu- 
tations are recursive, and the total computing efforts of Q9EP and QMP 
are roughly equal. The paper contains several examples to illustrate 
the QEP. 


l. INTRODUCTION 


An important function of the Bell Laboratories Quality Assurance 
Center and the Western Electric Quality Assurance Directorate is to 
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audit the quality of the products manufactured, and the services 
provided by the Western Electric Company to determine if the in- 
tended quality standards are met. This is achieved by dividing the 
products and services into some 3000 homogeneous classes. A small 
sample is taken from each class during each period (there are eight 
rating periods in a year). Based on this data, an inference is made in 
each period regarding the compliance of each class to the quality 
standard. 

Until the sixth period of 1980, the ¢-rate system, evolved from the 
work of Dodge and others,’ was used to rate the product quality. 
Starting with the seventh period of 1980, the Quality Measurement 
Plan (QMP) was implemented. The QP, developed by A. B. Hoadley,” 
is based on an empirical Bayes model of the audit-sampling process. It 
uses the current and the preceding five periods of data. It represents 
a considerable improvement in the statistical power for detecting 
substandard quality as compared with the old rules based on the f- 
rate. However, QMP ignores the time order of the observations, so it is 
less sensitive to drifts in the process mean. The Quality Evaluation 
Plan (QEP) has been designed to take into account the time order of 
the data and to be more sensitive to drifts in the process mean. 

The object of this paper is to present the Quality Evaluation Plan, 
which uses the entire time series of data on a given class to determine 
if that class meets the quality standard. The time series is modeled by 
a stochastic process, which allows for the possibility of the process 
mean to (i) drift or (ii) fluctuate around a fixed value. An adaptive 
Kalman filtering theory is developed for filtering out the sampling 
variance and obtaining the best estimate of the true defect index and 
its confidence interval. Some of the salient features of QEP are: (i) the 
best estimate of the defect index is obtained by an adaptive exponential 
smoothing process, making QEP more responsive to shifts and drifts in 
the process mean; (iz) the QMP model is a special case of the general 
model proposed here; and (iii) the computational method is recursive. 

This paper is divided into nine sections. We describe the model in 
Section II. Section III gives the Kalman filter solution of the model. 
Adaptive estimates of the model parameters are developed in Section 
IV, and in Section V we modify the Kalman filter solution of Section 
III to reflect the fact that the model parameters are estimated. The 
solution thus obtained is the adaptive Kalman filter. The construction 
of the box chart for displaying the results, and the rules for the 
exception report are spelled out in Section VI. The selection of the 
starting values for the estimation of the model parameters and the 
adaptive Kalman filter solution is discussed in Section VII. The 
algorithm has been tried on a number of rating classes and also on 
simulated data. We present representative examples in Section VIII. 
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Some numerical comparison between QEP and QP is also presented in 
that section. Finally, in Section IX, we discuss the features of the QEP 
and other potential applications of the adaptive Kalman filtering 
methodology developed in this paper. A summary of the QEP formulae 
is given in Appendix C. 

Parts of the derivation of QEP are heuristic. The heuristic has a 
sound theoretical foundation under two assumptions: (i) the audit 
sample size for a rating class does not vary in time by orders of 
magnitude, and (ii) the maximum likelihood estimates of the time 
series parameters fall within their feasible region. These assumptions 
are satisfied in about 95 percent of the audit examples. QEP appears to 
work for the other 5 percent as well, but this has not been fully tested. 


Il. DESCRIPTION OF THE MODEL 


Let @ denote the true defect index in period ¢ for the particular 
rating class under study. Thus, 


Total number of defects present in 
the production of period t 
A, = ——---> 
Total number of defects allowed in 
that production under the quality standard 


In deriving the present gmp, Bruce Hoadley” assumed that over a time 
window of six periods the successive values of @; are independently and 
identically distributed around a fixed mean, called the long-term 
process mean. Consequently, the time order of the past observations 
is ignored in estimating the current defect index. Hence, QMP responds 
to a drift in the defect index only through having the moving window, 
which means a slow response. In our model we will overcome this 
deficiency by explicitly allowing for drift and serial correlation. 

The mathematical analysis of serially correlated data is greatly 
simplified when the random variables involved are normally distrib- 
uted. The audit problem can be put in this framework by the square 
root transformation described in the following paragraph. 

For the chosen sample, let e; be the expected number of defects 
under standard quality, x; be the observed number of defects, and 
I, = x:/e: be the observed defect index; then x; has a Poisson distribu- 
tion with mean e,6;. It is well known that the distribution of Vx; can be 
approximated by the Gaussian density with mean Ve,6, and variance 
%, Let Y; = VI,. The distribution of Y; is approximately normal with 
mean V6, and variance 0.25/e;. We shall denote V6, by §; and refer to 
it as the transformed true defect index, or simply as the true defect 
index. 
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When the observations are defined in terms of demerits or defectives 
we will take x; and e, equal to the observed and the expected equivalent 
defects, respectively, as defined by Hoadley.’ In this case the distri- 
bution of x; is approximately Poisson with mean @,e;, so we can still 
use the square root transform defined in the previous paragraph. 

Autoregressive-integrated-moving average models with appropriate 
trend terms may be used to characterize a wide range of serial corre- 
lations and trends. However, since the available data on each product 
is limited, it is essential that we keep the structure simple, involving 
only a few parameters. Thus, we propose the following model for the 
variation of ¢;: 


& = me + Pu, . (1) 


where m; is the trend term (including the mean) and 7; is the deviation 
from the trend. The successive values of 1; will be assumed to be 
independently distributed with zero mean and variance o%;. 

Since the exact nature of the drift is not known to begin with, we 
shall assume that m; is a random walk. We found in control engineering 
literature‘ that the random walk model serves well in tracking a variety 
of trends in unknown parameters, and therefore we chose to use it in 
the present problem. Thus, 


Me: = Met-1 + Pat, (2) 


where p2; is a sequence of independently distributed random variables 
with mean zero and variance o3;. Further, the sequence vz will be 
assumed to be independent of the sequence v1, 

Equations (1) and (2) thus characterize the variation of the defect 
index—the component m;, describes the low-frequency (smooth) 
changes, while vy describes the high-frequency changes in ¢. If we 
take o3, = 0 and oj; = oj = constant, then these equations imply that 
the {7s and hence @,’s are independently and identically distributed. 
Thus, the QMP model is a special case of the general model of this 
paper. 

The transformed observed defect index, Y;, is the transformed true 
defect index plus the sampling error, 7. Thus, 


= 0+. (3) 


As discussed earlier, the expected value of Y; is {; and the variance of 
Y; is 0.25/e:, so 7 has zero mean and its variance is equal to 0.25/e;. 
We assume that the successive random variables 7; are independent. 
Also, since the origins of 7, vi, and v2; are unrelated, we assume that 
these three series are mutually uncrosscorrelated. Further, the distri- 
butions of 71, v2, and y: are assumed to be normal. The justification 
for this assumption comes from the fact that Y?s are approximately 


normally distributed. 
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The problem at hand is to make an inference on @, given data up to 
and including the nth time period. In particular, we wish to determine 
the posterior probability of the event that 0, exceeds one. 


lll. KALMAN FILTER SOLUTION 


In the Kalman filter terminology, ¢, and m, are the unobserved state 
variables about which we wish to make inference using the observa- 
tions Yi, ---, Yn. Let us, for now, assume that the model parameters 
ir, Oa, and o%, are known for ¢ = 1, --- n and that the means and the 
variances of { and mo are known. Then the Kalman filter provides 
recursive formulae for estimating the posterior means and variances of 
$, and m,. The derivation of the general Kalman filter may be found 
in a number of books (e.g., see Jazwinsky” or Gelb*). A simple deriva- 
tion for the special case of the audit model is given in Appendix A. 
The desired recursive formulae are given below. 

Conditional on the data up to time ¢, the distribution of m; is normal 
with mean m; and variance q:, 


i.e., m:| t~ N(me, qt), 


where MM, = Wott + (1 — wa) Y; (4) 
Qt = (1 — wx) (0% + o>) (5) 

2 4 2 
wont a (6) 


> ae 2 2 ° 
Or + One + O2¢ + Qe-1 


Likewise, conditional on the data up to time ¢, the distribution of ¢; 
is normal with mean ¢; and variance pz, 


1.e., ¢ a N(é, Pe), 


where c = WyWMe1 + (1 — wow) Ye (7) 
Pt= (1 = W2eW14) Ore (8) 

2 

oO 
W2tWit = Z (9) 


2 2 2 ; 
Ont + O12 + O2¢ + Qr-1 


To use these recursive equations the starting values mo and qo must 
be specified. The choice of these values is discussed in Section VII. 
For now, we note that as t > o, the effect of the starting values 
reduces to zero. 

Notice that eq. (4) is an adaptive, exponential smoothing equation. 
The smoothing constant, we, is a function of time and is determined 
by the relative values of the different variances as given by eq. (6). 
Observe that V(Y;|m:) = of + 0% so that of + o% measures the 
uncertainty in using Y; for estimating m;; also, ox = V(m:|m.-1) and 
gir-i = V(m-1|t — 1). Thus, o% + q:-1 is a measure of uncertainty in 
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using 7,-, for estimating m;. It is clear from eqs. (4) and (6) that the 
weights given to Y; and 77;-; are inversely proportional to their respec- 
tive uncertainties in estimating m. 

Equations (7) and (9) are the analogous equations for computing 
the posterior mean of {&;. Note that V(Y;|{-) = 0%: is the uncertainty 
in using Y; to estimate ¢. Further, V({&|m:1) = ot + 0% and 
V(m-1|t — 1) = qr-1 so that of; + o% + qr: is the uncertainty in using 
Mm-1 to estimate &. The weights on Y; and 7-1 are thus seen to be 
inversely proportional to the respective uncertainties. 

From eq. (8) we note that the posterior variance of ¢; conditional on 
data up to time ¢ is smaller than o%, by the factor (1 — w2;w1). Thus, 
the factor (1 — w2w1-) represents the advantage of filtering in estimat- 
ing ¢,. Similarly, from eq. (5) we see that the factor (1 — wz) is the 
benefit of filtering in estimating m:. 

To compare the QMP model given in Ref. 2 with the gEP model we 
shall rewrite eqs. (7) and (9) as follows: 


& = wy: + (1 — wu) Yr 
2 
Ont + Of 
Analogous to the Qmp, eq. (7) expresses % as a weighted sum of 7f;, the 
estimated current mean level, and Y;, the current observation. The 
weight w1; is analogous to the shrinkage constant given in Ref. 2, and 
we will also call it a shrinkage constant. 

The discussion of this section was based on the assumption that o7;, 
03, and o7, are known quantities. However, in the audit problem oj; 
and o3 are not known and must be estimated from the observed data. 
In the following section we will derive the estimates of of, and o%,, and 
in Section V we will modify eqs. (4) through (9) to accommodate the 
fact that oj, and 03, are estimated. 


IV. ESTIMATION OF THE MODEL PARAMETERS 


Consider the case where oi; = oj = constant, 03; = o3 = constant, 
and o7, = o7 = constant. 

Let us define Z, = Y; — Y;-1. Under the assumed model EF (Z,) = 0 
and the autocovariances of Z; are given by 


E(Z?) = 2o0{ + 03 + 20%, (10) 
E(Z.Z:-1) = —oj — 03, (11) 

and 
E(Z.Z:-) =0, 22. (12) 
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Thus, Z; is a first-order moving average [MA(1)] process of zero mean, 
i.e., Z; can be represented as 


Zt = Ar+ Bari, (13) 


where a; is a white noise series of variance o”. The parameters 6 and 
o” are related to oj, 03, and o; through the autocovariance function; 
L.eé., 


E(Z?) = (1 + B’)o? = 203 + 03 + 207, (14) 


and 
E(Z,Z:-1) = Bo? = —oj — 0°. (15) 
Solving eqs. (14) and (15) we have 
oi = —Bo* — a; (16) 
and 
o3 = (1+ B)’o”. (17) 


The nonnegativity of of and the invertibility of the model given by eq. 
(13) impose the following restriction on B: —1 < 8 = —o?/o”’. Thus, the 
feasible region for B and a” is the one enclosed by the lines: 8 = —1, 
Bo” = —o2, and o” = «, The region is shown in Fig. 1. 

The parameters 8 and o” can be estimated using a suitable time- 
series method. Once £ and o” are known, eqs. (16) and (17) may be 
used to estimate oj and o3. 

Contrary to the assumption made at the beginning of this section, 
Oi, 63, and o2, may in fact vary from period to period. Through eqs. 
(14) and (15) this implies that @ and o” may vary with time. In other 
words, Z; is like a MA(1) process with changing parameters. Therefore, 


B 
g2 


0 0.1 0.2 0.3 0.4 0.5 


o2=0,02 + Ba2=0-. 
x ! aa 
Y 
ALINE WITH 
of = CONSTANT 
FEASIBLE REGION 
-1 


-07 o7=0,B=-1 


Fig. 1—Feasible region for 8 and o”. 
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we need an adaptive, recursive estimation method for estimating 8 and 
o”, rather than the usual time series estimation methods. The recursive 
method of Phadke’ will therefore be used here. The method discounts 
the past data exponentially and thus can respond to changes in the 
model parameters. The necessary recursion formulae are given below. 
Appendix B gives the derivation of these formulae. 


Bi = Bo — Ri’, (18) 
6? = Si(Br)/Ar, (19) 
where 

Ve = AMR + 2a (20) 
R,.=ARe-1 + 2 (=) (21) 
ar = Z; — Boar-1 (22) 

da dar-1 
7 ee a Bo~ aa (23) 
Si(Bo) = ASi-1(Bo) + a? (24) 
Si(Be) = Se(Bo) + (Be — Bo)ve + ¥(Be — Bo)"Re (25) 
Ai = Ati + 1. (26) 


The choice of the starting values for these recursions will be studied 
in Section VII. The parameter A, 0 < A <= 1, determines how fast the 
old data is discounted in estimating the model parameters. A = 1 
implies that the entire past data is used. The smaller the A the faster 
the past data is discounted. 

The estimates 6? and £; are uncorrelated and have the following 
approximate variances: 


V(B:) = 26?/R; (27) 
V (62) = 264/A). (28) 


The estimated values of o” and B may be substituted in eqs. 
(16) and (17) to compute oj; and o3,. When on Varies with time, we 
should use the exponentially smoothed value, o;,:, as defined below: 


074 = Aozy-1 + (1 — A)or, (29) 
in place of o7 in eq. (16). Thus, 


AQ __ i A2 2 
Ou = —Bid% —~ On,ts 
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and 
6 = (1+ B,)°6?. 

In the rare case of extreme variations (order of magnitude variations) 
in o7,, eq. (29) has not been fully tested, so we urge caution for such 
cases. 

It is possible that eqs. (18) and (19) would yield infeasible values of 
B, and G7. In that case we propose the following truncation rules, which 
take us close to the maximum point on the feasible boundary. 

Step 1: Truncate f£; to the region [—1, 0]. Denote the truncated 
value by 87, where 

Bo cass v/R: if —1 = Bo — v./R: = 0 
Bi= 0 if Bo-—“/R:.>0 
—1 if Bo — 1 /Ri<—I1 


Step 2: Compute of? and 63;: 


= S(BP)/Ar= {S:(Bo) + (BF — Bo)ve + (Bi — Bo)?Ri}/At 
63 = (1 + BP)? of? 
* 2 


Step 3: If Bf and of a Pelong to the feasible region, 1, A e., if (—Bio; 
= a+) > 0 then Bi = Bi : o = of? ’ and o Gt = — 6? — Ont. 

Step 4: i Be and of” do not belong to the feasible region, i.e., 
if (Biot? — o2,) <0, then set 63, = 0. Compute f; and 6? by solving 
the following two equations: 


6: = —B.d? — 0% = 0, 
and 
63 = (1 + B,)?67 
The resulting 2, and 6? are given by 
4 _ —(2 + G/ore) + V(2 + G3:/o40) — 4 
5 pce i Ms Ne an 
2 
and 
Gi = — o/ Bi. 
Note that when these truncations are applied there will be a larger 
degree of approximation involved in using eqs. (27) and (28) for 
computing the variances of £; and 67. However, these variances enter 
only into the secondary terms of the adaptive Kalman filter to be 


derived in the next section. Consequently, we may ignore the effect of 
truncation. 
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V. ADAPTIVE KALMAN FILTER 


In deriving the Kalman filter solution of Section II, it was assumed 
that oj; and 03 are known quantities. Since in the audit problem these 
parameters are estimated using the observed data, we need to make 
due modifications to the Kalman filter solution. We will refer to the 
resulting formulae as the adaptive Kalman filter. 

Consider the distribution of m; conditional on data up to time ¢: 


= E(m,|t) = EE(mi\t, oi, 63) = E (wodite-1 + (L — wae) Ye); 


hence, 
My, = OnMu-1 + (1 — wee) Ve (30) 


where 
A2 2 
eens Ls (31) 
Oi + One + 02 + Qr-1 
The distribution of wy conditional on data up to time f¢ is very 
complicated. So the expected value of w2 cannot be simply computed. 
Therefore, in eq. (30) we have approximated E(w2|t) by x, the 
maximum posterior density point. This approximation would be very 
good when #; and 6? lie inside the feasible region. However, if 2, and 
6? lie on the boundary of the feasible region, és; will be a less accurate 
approximation of E (w2|t). The extent of the inaccuracy will depend 
on the values of R; and A;. For large R; and A; the likelihood of 8; and 
o; will drop very rapidly as one goes away from the feasible boundary 
nearest to the maximum likelihood point. Thus, the inaccuracy would 
be smaller for larger values of R; and A:. 
Now consider the variance of m; given data up to time f: 


= Vim:|t) = EV(m.|t, Oi, 031) + VE (m,| t, Ot; a3) 


= E[(1 — wee) (oi + oz) ] + V[wotitr1 + (1 — wor) Yr]; 
hence, 


Qe = (1 — War) (Gt + oF) + (Ye — r-1)?V (wr). (32) 


The effect of £, and 6? lying on the feasible boundary will be to 
introduce an inaccuracy in the term (1 — 2)(6i: + o7) of eq. (32), as 
discussed above. Knowing the variance of £; and o7, the variance of 
wo: can be derived via the Taylor series approximation as follows. We 
have 

papi 64 + Or 
= Git + Ont + 65; + Qt-1 
2 — Bio? — oF + on 
(1 + Bi + Bi)o7 + gi + On a Ont 
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~ bo + + (=) (B:— Br) + (=) (a? — 6?). 


Hence, to the first-order approximation the variance of wz»; is 


V (woe) = ( =) V(B:) + (x) V(62) 


= {267[1 + bx(1 + 2B.) P/R: 
+ 2648: + (1+ Be t+ B2)bx9?/Ad 
+ (Gtr + of + Ge + Gr). (33) 





Note that the smoothing constant ws; is restricted to the interval 
[0, 1]. The most noninformative distribution on this interval is the 
uniform distribution whose variance is “412. The @2; computed by eq. 
(31) clearly adheres to the interval [0, 1]. However, because of the 
approximations involved, the computed V(w2;) may come out larger 
than Ye. In that case we propose to truncate it to “e. 

When B: and 6; lie on the boundary of their feasible region, the use 
of the Taylor series approximation would yield inaccurate estimates of 
V (we). Since the contribution of this variance is secondary in comput- 
ing V(m,|t), we may ignore the effect of truncation. 

We can proceed in an analogous way to compute E({;|t) = % and 
V(f.|t) = pz to yield: 


& = Wor@1eMe-1 Se (1 a Gee@it) Ye, (34) 
where 
2 
A Ont 
has ee 35 
Wit oe + 62, ( ) 
and 
Pt= (1 —- (or6014) Ont + (Y;- Me-1)"V (woew1) ’ (36) 
where 


V(worw1) = [267 (1+ 2B1)753 003) /R: 
+ 264(1 + Br + B?)5363./Ac] 
+ (6% + On + 63, + qr-1)”. (37) 


For the reasons discussed in the case of V(w2), if eq. (37) yields a 
value of V (weenie) > Ye, then we will truncate it to “2. 

The effects of 8, and 67 lying on the feasible boundary are similar to 
those explained in connection with m; and q:. 
VI. BOX CHART AND THE EXCEPTION REPORT 


In Ref. 2 Bruce Hoadley has proposed a format for displaying the 


KALMAN FILTERING 2091 


conditional distribution of 6, given data up to time ¢. He has also 
proposed exception rules in terms of this distribution. We shall use the 
same reporting format and the exception rules. 

The conditional distribution of 6, will be summarized by a box chart 
that shows the 99, 95, 5, and 1 percentiles of the distribution, the best 
estimate of 6,, denoted by 6, the mean level, M,, and the current 
defect index J,. By applying the inverse square root transformation, 
we have 


M,=m? and 6,= &. 


The quantiles of @, are once again obtained by squaring the quantiles 
of ¢. Since ¢; is restricted to be positive, and we have approximated its 
density by the normal distribution, we may have to truncate some of 
the extreme quantiles to zero. If we take this fact into account, the 
desired quantiles of @, are: 


Q: = 99% quantile = [max(& — 2-326Vp,, 0)}? 
Q2 = 95% quantile = [max(& — 1-645 Vp, 0)]? 
Qs; = 5% quantile = (& + 1-645 Vp,)? 
Qs = 1% quantile = (£, + 2-326 Vp,)*. 


A sample box chart is shown in Fig. 2. 
The exception rules are: 
(t) Below Normal: A rating class will be declared below normal if 
the posterior probability of @, being larger than one exceeds 0.99. 
(zz) Alert: An alert will be declared for a rating class if the posterior 
probability of @, being greater than one exceeds 0.95 but it is less than 
or equal to 0.99. 








Q, 1% QUANTILE 


5% QUANTILE 


QUALITY STANDARD 


DEFECT INDEX 


95% QUANTILE 


Q, 99% QUANTILE 


ee ec a 


Fig. 2—Sample box chart of the conditional distribution of 9,. 
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So in terms of the quantiles derived above, we will declare below 
normal if Q; > 1 and alert if Qi = 1 but Q. > 1. 


Vil. CHOICE OF STARTING VALUES 


The Kalman filter solution described in Section III and the estima- 
tion of model parameters described in Section IV are both recursive 
procedures that must be appropriately initialized. Note that since each 
of these procedures discounts the past data, the effect of initialization 
diminishes to zero as more data is accumulated on any rating class. So 
any biases introduced by the initialization process are transient and 
temporary. The best way to choose the initial values is by analyzing 
the historical data on all rating classes. Pending such an analysis, we 
shall tentatively choose the initial parameter values, as follows. 

We will take 7% = 1.0 and go = 0.134. This amounts to choosing a 
very diffused prior distribution on the mean level. On the square-root 
defect-index scale the lower and the upper one percentiles of this 
distribution are 0.149 and 1.851, respectively; while on the defect-index 
scale the lower and the upper one percentiles are 0.022 and 3.428, 
respectively. The mean and the median of this distribution are equal 
to one on either scale. Consistent with this we will choose Yo = 1.0. 

The parameter o79 should be taken equal to the variance of the 
transformed defect index associated with the planned equivalent ex- 
pectancy for a period’s sample for the particular rating class. In the 
present analysis we will take e9 = e; and 679 = 070 = 0.25/eo. 

The parameter ’ determines how many periods of data are effec- 
tively used in estimating the time series parameters £; and o7 and, 
hence, the parameters oj, and o3;. We will take A = 0.95, which implies 
that effectively 1/(1 — A) = 20 periods of data are used in estimating 
the model parameters. 

We also need to specify the values of Bo, ao, dao/dB, So( Bo), eo, Ro, 
and Ao. All these variables enter into the recursive maximum likelihood 
estimation of 8 and o”. We shall take Bo = —0.6, which is an approximate 
midpoint of the feasible range of 6. The quantities ao and dao/dB will 
be taken equal to their respective expected values, namely, zero in 
each case; and Apo will be set equal to its steady-state value, namely, 
1/(1 — A). We will take vo = 0, So( Bo) = 0.625/[eo(1 — A)], and Ro = 
20.0/eo. 

The above starting values imply that at t = 0 the mean and the 
variance of 8 are respectively —0.6 and 0.063. The variance of the 
uniform distribution on the (—1, 0) interval is “Ye = 0.083. Since the 
feasible interval for B is smaller than (—1, 0), the variance of 0.063 
represents a fairly diffused initialization. 

Also, the above starting values imply that the mean and the variance 
of o” at t = 0 are 2.5 o%) and 0.625(020)”. Therefore, by the gamma 
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density assumption, the 95-percent confidence interval on o? is (1.2 070, 
4,28 0%), which is a very wide interval. 

The values of of and o3 implied by the above starting values are 
0.5 675 and 0.4 o%, respectively. 


Vill. ILLUSTRATIVE EXAMPLES 


To illustrate the properties of the quality evaluation plan we shall 
now present six examples. The first three examples are the simulated 
examples, while the latter three use real audit data. 

Example 1: Figure 3a shows the response of QEP to a sudden shift 
in the quality level. For the first ten periods the observed defect index 
fluctuates randomly around the fixed level 3.0. From the eleventh to 
the twentieth period the observed defect index is fixed at 1.0. In each 
period the expectancy at standard is 5.0. Notice that starting with the 
eleventh period the estimated mean level rapidly approaches the new 
mean level. Also, starting with the eleventh observation the product 
gets off the exception report. Figure 3b shows the corresponding results 
for qP. It is clear that in terms of both M; and 6, the response of QEP 
is quicker than the response of QmP. 

Example 2: Figure 4a displays the response of QEP to a linear trend 
in the quality level. As in the case of Example 1, for the first ten 
periods the observed defect index randomly fluctuates around the fixed 
level 3.0. From the eleventh to the twentieth period the observed 
defect index has a linear downtrend, as plotted. In all twenty periods 
the expectancy at standard is 5.0. Notice that both M, and @, follow 
the trend with a small lag. Also note that the QEP algorithm recognizes 
that the process has a drift rather than random fluctuation. Conse- 
quently, M, and 6, are very close while following the drift. Figure 4b 
shows the results of gmp for the same data. Here again, M, and #, 
follow the trend, but the lag is much larger. This is manifested in the 
fact that QEP gets the product off below normal in the seventeenth 
period while with qmp that happens a period later. 

Example 3: This example illustrates that QEP and qmP have similar 
behaviors when the defect index fluctuates about a fixed value for a 
long period of time. Figure 5a shows the results of gEP when the defect 
index fluctuates around the fixed level of 2.0, while Fig. 5b shows the 
results of amp with the same data. Note that both the methods declare 
below normals and alerts in the same periods. 

Example 4: Figure 6a gives the data for repaired remreed grids, 
rating class OCO038TT, for periods 7801 through 7904. The periods are 
numbered 1 through 12 in the figure. The QEP results are also shown 
in the figure. Similar results with Qmp are nlotted in Fig. 6b. In 
response in the drift in the quality we see that QEP attaches a heavier 
weight to the current data. Consequently, with QEP the mean level, M, 
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Fig. 3—Response to a sudden shift in the quality level for: (a) QEP, and (b) QMP. 


follows the drift more closely than with qmpP. In the period 5, recogniz- 
ing the drift, gEP takes the product off the exception report while MP 
still calls it an alert. Also period 7 is an alert according to QmP while, 
according to QEP, it is off the exception report. These differences 
between QEP and QmP are clearly seen to be the result of the fact that 
QEP exponentially discounts the past data, while QMP considers every 
observation in the six-period window to be equally important. 
Example 5: Figures 7a and 7b give the results of QEP and QmP, 
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Fig. 4—Response to a linear trend in the quality level for: (a) QEP, and (b) QMP. 


respectively, for the rating class OHO60CM, consisting of modular 
telephone chords. The periods covered by the chart are 7701-7808. As 
we saw in Example 4, QEP follows the drift more closely. In terms of 
the exception report, there are several differences. In periods 8, 15, and 
16 QmP declares below normal, while QEP calls it only an alert. In 


period 1Namp calle tt an alart while OFD does nat declare anv eycentinn 
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These differences are once again a result of the fact that QEP recognizes 
the drift and hence heavily discounts the past data. 
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Fig. 5—Response to a random fluctuation in the quality level for: (a) QEP, and (b) 
QMP. 


Example 6: The last example to be considered is the rating class 
MV104MJ. The results of both the methods are shown in Fig. 8. In 
this example the quality fluctuates more or less randomly about a 
fixed mean and, as expected, the two methods give comparable results. 

The average values of the weights w; and wz, and the equivalent 
expectancies e,; for the three audit examples are tabulated in Table I. 
Notice that average value of we; for OCO38TT and OHO60CM is 0.48 
compared with 0.55 for MV104MJ. This is a direct consequence of the 
fact that MV104MJ does not exhibit a drift while the others do. The 
high-frequency fluctuation about the mean function M; is depicted by 
wiz. Relative to the sampling variance (0.25/e,) OCO38TT exhibits a 
smaller fluctuation than OHO60CM. This concurs with the average 
values of w., for these rating classes. 
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Fig. 6—Results for the rating class OCO38TT, periods 7801 through 7904 for (a) QEP, 
and (b) QMpP. 


Through the preceding examples it is quite apparent that QEP and 
QMP could give somewhat different results. Now the key question is: 
Which method yields a more precise estimate of the unobserved “true 
defect index’? The only way to answer this question decisively is to 
take a 100-percent sample of a number of rating classes to find out the 
true defect indices and compare them with the QEP and QmpP results. 
This is obviously an impossible task. A feasible way to answer the 
question is by using the models to predict one step ahead and compare 
the mean-squared prediction errors. Note that M,_, is a predictor of J;. 
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Fig. 7—Results for the rating class OHO60CM, periods 7701 through 7808 for: (a) QEP, 
and (b) QMP. 


The mean-squared errors for the three audit data examples, viz. 
Examples 4, 5, and 6, are given in Table II. For the rating class 
OCO038TT we notice that the mean-squared prediction error (m.s.p.e.) 
of QMP is 33 percent larger than that of QEP; for OHO60CM the m.s.p.e. 
of QMP is 11 percent larger, and for MV104MJ the m.s.p.e. of QMP is 
only 3 percent larger. Thus, whenever there is a drift in the quality we 
may expect QEP to perform better than qmp, whereas if the quality 
fluctuates randomly around a fixed mean, both QMP and QEP would 
give similar results. 

Effect of truncation: In addition to the numerical examples cited 
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Fig. 8—Results for the rating class MV104M4, periods 7701 through 7808 for: (a) QEP, 
and (b) QMP. 


above, a limited numerical study was made with thirteen representa- 
tive rating classes. Each rating class had about fourteen periods of 
data. This represents a total of 182 test periods. Among these examples, 
truncation occurred in only 7 percent of the periods. Except for one 
case, all the truncations caused 67, = 0. These cases of truncation could 
be recognized broadly as situations where the variance of the observed 
defect indices was much smaller than that for the Poisson distribution. 
In each case of truncation the confidence intervals computed by QEP 
looked reasonable and comparable to those obtained by QMP, so we 
can tentatively conclude that the effect of truncation is negligible. Of 
course, an extensive trial of QEP may suggest some modifications to 
the truncation rules. 

One such modification may be to view the likelihood function of 6; 
and o; as the posterior-probability density function. Then the Bayes 
estimates of 6; and o7 may be used in place of the maximum likelihood 
estimates used in this paper. Because of the complexity of the feasible 
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Table I—Computed aep weights for the 
examples 


Average Value of 


Rating Class Wit Wet er 
OC038TT 0.83 0.48 3.7 
OH060CM 0.67 0.48 7.9 
MV104MJ 0.75 0.55 4.6 


Table Il—Comparison of the mean- 
squared prediction error 
Mean Squared Predic- 


tion Error 
Rating Class QEP QMP 
OCO038TT 0.69 0.92 
OH060CM 0.94 1.04 
MV104MJ 0.29 0.30 


region, computing Bayes estimates would involve extensive numerical 
effort, which may be unnecessary. 


IX. DISCUSSIONS 


In summary, the QEP model consists of two parts—the system model 
and the observation model. The system model states that the trans- 
formed true-defect index is equal to the process mean that follows the 
random walk model plus process fluctuation, which is statistically 
independent from period to period. The random walk model takes care 
of the process drift. The observation model states that the transformed 
observed defect index is equal to the transformed true-defect index 
plus sampling error with a known variance. The different parameters 
of the QEP model are estimated from the observed data by the recur- 
sive, exponentially discounted, maximum likelihood method. The suc- 
cessive transformed true defect indices and the process mean levels 
are then estimated by the adaptive Kalman filtering algorithm. 

From the derivation of the plan and the illustrative examples the 
following advantages of QEP are apparent: 

(t) The QEP model takes into account the time order of the 
observations, while in QMP the time order of the observations is 
ignored. 

(ti) The best estimate of the process mean level is obtained by an 
adaptive exponential smoothing procedure. This makes QEP more 
responsive to the shifts and drifts in the process level. This is evidenced 
by the lower mean-squared prediction error for the examples discussed 
in Section VIII. 
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(tit) The QMP model is a special case of the QEP model. However, 
the two algorithms are quite different. 

(iv) The computations are recursive. The entire past data are 
summarized by ten numbers. 

(v) The computational efforts of QEP and QmpP algorithms are 
comparable. 

In the light of the advantages listed above it is proposed that QEP be 
considered as a serious alternative to qmP for official rating. In prep- 
aration for using QEP it is suggested that it be tried on all rating classes 
for a number of rating periods, and the resulting exception reports 
carefully compared with those from the QmpP and the f-rate system. 
Such a study would aid us in fine tuning the starting conditions, 
quantifying the effect of truncation, and perhaps in making some other 
minor modifications for improving the performance of the QEP. 

For small expectancies, the square root transform of the Poisson 
distribution has a significantly different variance than 0.25, assumed 
in Section II. Since the audit samples can at times be very small it 
would be necessary to use the correct variances. A study of this aspect 
will be done in a later memorandum. 

The adaptive Kalman filtering methodology derived in this paper, 
with appropriate extensions and modifications, can be put to many 
other applications. In the field of quality control, Phadke’ had devel- 
oped a sequential empirical Bayes acceptance sampling plan. The 
adaptive Kaman filtering method developed in this paper would be 
particularly suited for updating the empirical prior distribution. An- 
other potential application is in combining the traditional X and R 
control charts into a single box chart. Here the adaptive Kalman filter 
would permit one to take into account serial correlation in the data as 
well as process drifts and shifts, and changes in the process variance. 
Yet another application is in adaptive time series forecasting. 
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APPENDIX A 
Derivation of the Kalman filter solution 


Let the conditional distribution of m;_; given data up to time ¢t — 1 
be normal with mean m,_; and variance q;-1, i.e., 


me-1|t -1~N(me-, Qt-1). (38) 


Eq. (2) expresses m; as a sum of two independent normal random 
variables m;-1 and vg. Since the mean and variance of vy are respec- 
tively 0 and o%;, it follows that 


melt —1~ N(r-a, 03: + Qr-1). (39) 
Substituting eq. (1) in eq. (3) we have 
Yi = me + Vue + 1, (40) 
which implies that 
Y;|m:~ N(mi, of: + 020). (41) 


In the Bayesian framework we may view eq. (39) as a prior distri- 
bution on m, and Y; as an observation of m with the distribution 
specified by eq. (41). Applying the Bayes theorem the distribution of 
m, conditional on data up to time ¢ is seen to be 


(mz = M1)” (Y, = mr)” 
f(m:|t) « exp| a 


o exp{— | ; (42) 
2a 
where 
Me = woithi-1 + (1 — wa) Yr, (4) 
Wee = (Ot + On) /(Oie + O22 + 03 + Grr); (5) 
and 
Qe = (1 — wx) (oi + ort). (6) 


From eq. (42) it can be inferred that the distribution of m; conditional 
on data up to time ¢ is normal with mean m; and variance q;. 

Equations (7) through (9), used for computing the conditional dis- 
tribution of ¢;, can be derived analogously as follows. First by substi- 
tuting eq. (2) in (1) we have 
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§: = Mer + ie + v223 (43) 
hence, 
&|t—1~ Nien, oi: + 03+ Qr-1). (44) 
From eq. (3) we have 
Y, | &~ Nk, One) : (45) 


Treating eq. (44) as a prior distribution for ¢; and applying the Bayes 
theorem, we readily obtain the distribution of ¢; conditional on data 
up to time ¢ as 


ae 2 a 2 
f(S|t) ce exp | = | 


2(o%, + 05: + Gr-1) 207, 
oc exp {- (Se = $e)" | ; (46) 
where 
§ = wrrwortra + (1 — wor) Ye, (7) 
WW = One| (Ore + oi; + 03 + Gi-1), (8) 
and 
Pt = (1 — wiwor) 074. (9) 


Thus, the conditional distribution of ¢ on data up to time ¢ is normal 
with mean ¢ and variance p;. Equations (7) through (9) form the 
desired recursive equations for computing ¢; and p:. 


APPENDIX B 
Estimation of B and o? 


Given the observed transformed defect indices Yo, Yi, Yo, -+- , Yn 
one can compute Z; = Y, — Y;-: fort = 1, --- , n. The Z; series follows 
MA(1) model given by eq. (13). The exponentially discounted proba- 
bility density function of a), --- , a: is given by 


Aj 
> 





P(ai, +++, a|o”) = [] (47) 


J=1 


1 
exp(—a}/204) 
V2a0° ; 
where A; = A‘ ’. Thus, the exponentially discounted probability density 
function of Z;, --- , Z; conditional on the knowledge of ao, B, and o” is 
given by 





eo 1 Wy 
Zi, +++, Z| ao, B, 02) = —a2/207)| , (48 
A | ao, B, 0) Tl | ow i/20") | (48) 
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where a; is related to Z; and f via the recursion relation in eq. (13), i.e., 


aj = Zj — Baj-r. (49) 
Thus, the conditional, exponentially discounted, log-likelihood func- 
tion is 


LAB, 0°) = —4(ArIn &° + S,/o”), (50) 
where 
A:= : aes (51) 
~ and i 
Si = > Na}. (52) 
pm 


By differentiating eq. (50) with respect to 6 and o” and equating the 
derivatives to zero, it can be shown that — He maximum at (Bi, 6 67), 
where f; is the minimum point of S,(8) and 6 = S,(B:) /At. 
In the neighborhood of a point Bo, we can ae a; by the 
linear function: 
da;(B) 





aj(B) = a(Bo) + (B — Bo) dB (53) 
Substituting this approximation in eq. (52) we have 
S.(B) ~ S:(Bo) + (B — Bo) + (8B — Bo)’R:, (54) 


where », R; and S;(8o) obey the recursion relations shown in eqs. (20) 
through (26). It is easy to verify that B,, given by eq. (18) minimizes 
the approximate S,(8) of eq. (54), and eq. (19) gives G7. 

The matrix of second partial derivatives of L; is 


R, 0 

20° 
A: |’ 
20% 


so by the Fisher-information theory, the estimates 2, and 6? are 
uncorrelated and their approximate variances are as given by eqs. (27) 
and (28). 

The above recursive procedure also has a Bayesian interpretation, 
as given by Phadke.° 


APPENDIX C 
Summary of the Formulae 
C.1 Initial conditions 
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go= 0.134 


Yo = 1.0 
Bo = —0.6 
A= 0.95 
dp 
€o = @1 
0.625 
So( Bo) = PT Bea ty 
Vo = 0 
Ro = 20.0/eo 
Ao = 1/(1—A) 


020 = 0.25/e0 


C.2 Recursive formulae 





I, = xfer 
Y.= vi 
Z:= Yi- Yi-r 
O71 = 0.25/e; 
a= LZ: — Boar-1 
da ee dar-1 
dB ap 
S:(Bo) = ASi-1(Bo) + a? 
da 
ve = Am1 + 2ay ah 
n= anes +2(H) 
A= \A--1 +1 
Bi = Bo — Ri’, -1<=fr=0 


S:(Bi) = S:(Bo) + (Bi — Bo)ve + Y2(BF — Bo)’R: 
of = Si(Bi)/Ae 
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oy = A o2n-1 + (1—A)or, 

6 = Biot? — on, gir 20 

6% = (1 + BF)?o?? 

If — Biot? — 07, = 0, then 
p= Bt &=ot. 

If — Byot? — 02, <0, then 


~ —(2 + Ge/or) + V(2 + 6%/ox) — 4 ; 


B: = 5) nd 
A ZT ,A 
G7 = —ont/ Br. 
2 
é oo 
Wt “A272 
On + Ont 
AQ 2 
P Oit + One 
Wo = 


the = Gorttua+ (1 — bo) Yr 
& = Out + (1 — Ou) Yr 
(268[1 + Go(1 + 28) P/Rd 


Vicon) = —t L26H1B + (1+ Be + BiBaiP*/Ad 
(Fie + G3 + Om + Gea)” ‘ 


[268(1 + 28,)°53,63,/ Re] 
+ [264(1 + B, + B2)633,/ Ai] 1 
Da a Bees ae V(wirwar) = — 
(Gite + G2¢ + One + Qr-1) 12 


a= (1- (er) (Gite + Ont) + (Y¥; — mMra)’V (wee) 


1 
V (a2) = 2 
V (1222) = 


p= (1- O1rGor) One + (Y; — rra)’V (wire) 


C.3 Points for the box chart 
Current defect index ae 


Best estimate of the defect index: 6; = ie. 


The mean level :M,= m? 

99% quantile :Q: = [max(& — 2.326 Vp, 0)? 
95% quantile : Qo = [max(& — 1.645 Vp, 0)? 
5% quantile > Qs = (& + 1.645 Vp,)? 

1% quantile Qs = (& + 2.326 Vp,)? 
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